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Preface 


This book is based on a graduate course taught four times, once in French at 
the Université de Montréal and then three times in English at the Institut 
fiir Theoretische Physik, in Innsbruck, Austria, at the Center for Quantum 
Spacetime, Department of Physics, Sogang University, Seoul, Korea, and most 
recently, a part of it at the African Institute for Mathematical Sciences (AIMS), 
Cape Town, South Africa. 

The course covered the contents of the magnificent Erice lectures of Coleman 
[31], “The Uses of Instantons”, in addition to several chapters based on 
independent research papers. However, it might be more properly entitled, “The 
Uses of Instantons for Dummies’. I met Sidney Coleman a few times, more 
than 30 years ago, and although I am sure that he was less impressed with the 
meetings than I was and probably relegated them to the dustbin of the memory, 
my debt to him is enormous. Without his lecture notes I cannot imagine how I 
would ever have been able to understand what the uses of instantons actually 
were. However, in his lecture notes, one finds that he also thanks and expresses 
gratitude to a multitude of eminent and great theoretical physicists of the era, 
indeed thanking them for “patiently explaining large portions of the subject” to 
him. Unfortunately, we cannot all be so lucky. Coleman’s lecture notes are a work 
of art; it is clear when one reads them that one is enjoying a master impressionist 
painter’s review of a subject, a review that transmits, as he says, the “awe and 
joy” of the beauty of the “wonderful things brought back from far places”. But 
then the hard work begins. 

Hence, through diligent, fastidious and brute force work, I have been able, I 
hope, to produce what I believe is a well-rounded, detailed monograph, essentially 
explaining in a manner accessible to first- and second-year graduate students the 
beauty and the depth of what is contained in Coleman’s lectures and in some 
elaborations of the whole field itself. 

Iam indebted to many, but I will thank explicitly Luc Vinet for impelling me to 
first give this course when I started out at the Université de Montréal; Gebhard 
Griibl for the opportunity to teach the course at the Universität Innsbruck in 
Innsbruck, Austria; Bum-Hoon Lee for the same honour at Sogang University 
in Seoul, Korea; and Fritz Hahne for the opportunity to give the lectures at 
the African Institute for Mathematical Sciences, Cape Town, South Africa. I 
thank the many students who took my course and suggested corrections to my 
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lectures. I thank Nick Manton, Chris Dobson, and Duncan Dormor, respectively, 
Fellow, Master and President of St John’s College, University of Cambridge in 
2015, for making available to me the many assets of the College that made it 
possible to work uninterrupted and in a pleasant ambiance on this book, during 
my stay as an Overseas Visiting Scholar. I also thank my many colleagues and 
friends who have helped me through discussions and advice; these include Ian 
Affleck, Richard MacKenzie, Eric Dupuis, Jacques Hurtubise, Keshav Dasgupta 
and Gordon Semenoff. 

I especially thank my wife Suneeti Phadke, who started the typing of my 
lectures in TeX and effectively typed more than half the book while caring for 
a six-month-old baby. This was no easy feat for someone with a background 
in Russian literature, devoid of the intricacies of mathematical typesetting. This 
book would not have come to fruition had it not been for her monumental efforts. 

I also thank my children Kiran and Meghana, whose very existence makes it 
a joy and a wonder to be alive. 


1 


Introduction 


This book covers the methods by which we can use instantons. What is an 
instanton? A straightforward definition is the following. Given a quantum system, 
an instanton is a solution of the equations of motion of the corresponding classical 
system; however, not for ordinary time, but for the analytically continued 
classical system in imaginary time. This means that we replace t with —77 in the 
classical equations of motion. Such solutions are alternatively called the solutions 
of the Euclidean equations of motion. 

This type of classical solution can be important in the semi-classical limit 
h — 0. The Feynman path integral, which we will study in its Euclideanized 
form in great detail in this book, gives the matrix element corresponding to the 
amplitude for an initial state at t = t; to be found in a final state at t=ty asa 
“path integral” 


(final, t|initial,t) = (final, tpe #4 -*)*@-P) linitial, t;) 


final,t ; 
-f Í DpDgek Strahl.) (1.1) 


nitial,t; 


where Alâ, p) is the quantum Hamiltonian and h(q,p) is the corresponding 
classical Hamiltonian of the dynamical system. The “path integral and 
integration measure DpDq defines an integration over all classical “paths” which 
satisfy the boundary conditions corresponding to the initial state at t; and to 
the final state at tp. It is intuitively evident, or certainly from the approximation 
method of stationary phase, that the dominant contribution, as A > 0, should 
come from the neighbourhood of the classical path which corresponds to a 
stationary (critical) point of the exponent, since the contributions from non- 
stationary points of the exponent become suppressed as the regions around them 
cause wild, self-annihilating variations of the exponential. 

However, the situation can occur where the particle (or quantum system in 
general) is classically forbidden from entering some parts of the configuration 
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Figure 1.1. A system trapped in the false vacuum will tunnel through the 
barrier to the state of lower energy 


space. In this case we are, generally speaking, considering tunnelling through 
a barrier, as depicted in Figure 1.1. Classically the particle is not allowed to 
enter the space where the potential energy is greater than the total energy of the 
particle. Indeed, if the energy of a particle is given by 


12 
B=T+V=54V(q) (1.2) 


then for a classically fixed energy, regions where E < V (q) require that T = g <0, 
which means that the kinetic energy has to be negative, and such regions are 
classically forbidden. Then what takes the role of the dominant contribution in 
the limit A —> 0, since no classical path can interpolate between the initial and 
final states? 

Heuristically such a region is attainable if t becomes imaginary. Indeed, if t > 


dq\* . (.dq\* dq\* l 
—ir then > 2 = , T becomes negative and then perhaps 
dt dt dt 


such regions are accessible. Hence it could be interesting to see what happens if we 
analytically continue to imaginary time, equivalent to continuing from Minkowski 
spacetime to Euclidean space, which is exactly what we will do. In fact, we 
will be able to obtain many results of the usual semi-classical WKB (Wentzel, 
Kramers and Brillouin) approximation [119, 77, 22], using the Euclidean space 
path integral. The amplitudes that we can calculate, although valid for the small 
A limit, are not normally attainable in any order in perturbation theory; they 
behave like ~ Ke~%0/"(1 + 0(h)). Such a behaviour actually corresponds to an 
essential singularity at h=0. 

The importance of being able to do this is manifold. Indeed, it is interesting 
to be able to reproduce the results that can be obtained by the standard 
WKB method for quantum mechanics using a technique that seems to have 
absolutely no connection with that method. Additionally, the methods that we 
will enunciate here can be generalized rather easily to essentially any quantum 
system, especially to the case of quantum field theory. Tunnelling phenomena 
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in quantum field theory are extremely important. The structure of the quantum 
chromodynamics (QCD) vacuum and its low-energy excitations is intimately 
connected to tunnelling. Various properties of the phases of quantum field 
theories are dramatically altered by the existence of tunelling. The decay of the 
false vacuum and the escape from inflation is also a tunnelling effect that is of 
paramount importance to cosmology, especially the early universe. The method 
of instantons lets us study all of these phenomena in one general framework. 


1.1 A Note on Notation 
We will use the following notation throughout this book: 
metric my, = (1,—1,—1,—1) 
Minkowsi time t 


Euclidean time T 


~~ aT 
=. e e oR 
ao oO A WwW 
Mim Neat Meee Ne 


Analytic continuation of time t— —ir 


2 
Quantum Mechanics and the Path Integral 


2.1 Schrödinger Equation and Probability 


Our starting point will be single-particle quantum mechanics as defined by the 
Schrédinger equation 


htv, =ħ («. -in=) U(a,t). (2.1) 


Here h(a, —iht) is a self-adjoint operator, the Hamiltonian on the space of wave- 
functions U(z,t), where x stands for any number of spatial degrees of freedom. 
The connection to physics of U(x,t) comes from the interpretation of U(x,t) as 
the amplitude of the probability to find the particle between x and «+ dz at 
time t; hence, the probability density is given by 

P(x, + da] = U* (x,t) U(z,t). (2.2) 


Correspondingly, the probability of finding the particle in a volume V is given by 
PIV] = i daW* (x,t)U(a,t). (2.3) 
V 


The state of the system is completely described by the wave function Y (x,t). It 
is the content of a standard course on quantum mechanics to find (x,t) for a 
given h(x, —iħ&). 


2.2 Position and Momentum Eigenstates 


For our purposes, we introduce the set of (improper) states |x} which diagonalize 
the position operator X, with 


X |x) = x|) (2.4) 


and 


Janae a (2.5) 
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We are in principle working in d dimensions, but we suppress the explicit 
dependence on the number of coordinates. The states are improper in the sense 
that the normalization is 


(rly) = d(a—y), (2.6) 


where ô(x— y) is the Dirac delta function. We also introduce the set of (improper) 
states |p) which diagonalize the momentum operator P 


P\p) = plp) (2.7) 
with 


fwo (2.8) 


but as with the position eigenstates 


(p\p') = ôlp— p’), (2.9) 


where 6(p—p’) is the Dirac delta function in momentum space. The improper 
states |x) and |p) are not vectors in the Hilbert space of states, they have infinite 
norm. They actually define vector-valued distributions, linear maps from the 
space of the square integrable functions of x or p or some suitable set of test 
functions usually taken to be of compact support, to actual vectors in the Hilbert 
space, 


Ie): fe) 1p) ~ f deto), (2.10) 


where the ~ should be interpreted as “loosely defined by”. For a more rigorous 
definition, see the book by Reed and Simon [107] or Glimm and Jaffe [55]. 
The operators X and P must satisfy the canonical commutation relation 


[X, P] = ih. (2.11) 


The algebraic relation Equation (2.11) is not adequate to determine P 
completely; there are infinitely many representations of the commutator 
Equation (2.11) in which X is diagonal. Taking the matrix element of Equation 
(2.11) between position eigenstates gives 


(z —y)(a|Ply) = (2|[X,P]ly) = ih(aly) = ihd (a — y). (2.12) 


For the more mathematically inclined, this expression does not make good sense, 
since the position and momentum operators are unbounded, though self-adjoint 
operators. They may only act on their respective domains and, correspondingly, 
the product of two unbounded operators requires proper analysis of the domains 
and ranges of the operators concerned and similar other difficulties can exist. We 
leave these subtleties out in what follows, and refer the interested reader to the 
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book on functional analysis by Reed and Simon [107]. We find the solution for 
(z|Ply) as 


(2|Ply) = -ihle - y) + côl) 


= in (aly) + cO(a—y), (2.13) 


where c is an arbitrary constant, using the property of the ô function that (a — 
y)o(x—y) =0. We will call the x representation the one in which the momentum 
operator is represented by a simple derivative, i.e. c= 0, 


(| Ply) = —ih (aly). (2.14) 


In this representation, 


(elo) = f aua\Pap tule) = fay (in elu) ) Co) 


d 
= —th— ; 2.1 
in-(alp) (2.15) 
Acting to the right directly in the left-hand side of Equation (2.15) gives 
^ Led 
(z|Plp) = p(z|p) = iñ; elp). (2.16) 
The appropriately normalized solution of the resulting differential equation is 
1 Dek 
(x|p) = er, (2.17) 
(2rh)? 
where d is the number of spatial dimensions. 
2.3 Energy Eigenstates and Semi-Classical States 
We can write the eigenstate of the Hamiltonian in the form |Y g), 
where A(X, P) is defined such that 
sh tad ` _d 
WAR PIN) =A (x-in E) (el) (2.19) 
for any vector |f) in the Hilbert space. Then 
Pae ` d 
(z|h( X, P)|% g) =h («int ) (2|U g) = E(z|Y g), (2.20) 


which implies the energy eigenfunctions are given by 


Up(x) = (z|T g). (2.21) 
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Correspondingly, 


a) = f dela) (elte) = f deWe(0)l0) (2.22) 


and 


A d 

h(a, th )We(2) = EV g(x). (2.23) 
xv 

A particle described by UW z(a) is most likely to be found in the region where 

W(x) is peaked. The time-dependent solution of the Schrödinger equation for 

—,Et 


static Hamiltonians is given by Up(x,t) = Ve(x)e , and the most general 


state of the system is a linear superposition 


U(x,t)= > ApWp (ae h”! (2.24) 
E 


with 
XC ApAn=1. (2.25) 
E 


Suppose the Hamiltonian can be modified by adjusting the potential, say, such 
that W(x) approaches a delta function: 


Wp(x) > O(a — 20). (2.26) 


We would then say that a particle in the energy level E is localized at the point 
xo. But in the limit of Equation (2.26) we clearly have 


|W) — |x0) (2.27) 


from Equation (2.22). Thus the states |x) describe particles localized at the 
spatial point x. This is conceptually important for the semi-classical limit. Semi- 
classically we think of particles as localized at points in the configuration space. 
Thus the states |x) and their generalizations are useful in the description of 
quantum systems in the semi-classical limit. 


2.4 Time Evolution and Transition Amplitudes 


Given a particle in a state |W;0) = |W) at t = 0, the Schrödinger equation, 
Equation (2.1), governs the time evolution of the state. The state at t = T is 
given by ree 

IT; T) =e tn XP) yy, (2.28) 


which satisfies the Schrödinger equation. The exponential of a self-adjoint 
operator, which accurs on the right-hand side of Equation (2.28), is rigorously 
defined via the spectral representation [107]. The probability amplitude for 
finding the particle in a state |®) at t = T is then given by 


Th(X,P) 


(Dlt; T) = (dle Tr |Y). (2.29) 
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We could derive an expression for this matrix element in terms of a “path 
integral’. Such an integral would be defined as an integral over the space of 
all classical paths starting from the initial state and ending at the final state, 
and we would find that the function that we would integrate is the exponential 
of —i times the classical action for each path. This is the standard Feynman path 
integral [45, 46], which was actually suggested by Dirac [40]. 


2.5 The Euclidean Path Integral 


Rather than the matrix element Equation (2.29), we are more interested in a 
path-integral representation of the matrix element 
Bh(X,P) 
(De wy, (2.30) 
where ( can be thought of as imaginary time 


T > —iß. (2.31) 


The derivation of the path-integral representation of Equation (2.30) is more 
rigorous than that for Equation (2.29); however, the derivation which follows can 
be almost identically taken over to the case of real time. This can be completed 
by the reader. It is the matrix element of Equation (2.30) that will interest us 
in future chapters. 

First of all, due to the linearity of quantum mechanics, it is sufficient to 
consider the matrix element 


BAR Ê 
(yle TARP) a), (2.32) 


To obtain Equation (2.30) we just integrate over z and y with appropriate 
smearing functions as in Equation (2.10). Now we write 


Bh(X,P) 

h =e e -e 7 
SS 
N-+1 factors 
where we mean N +1 factors on the right-hand side and (N + 1)e = 8. Next we 


insert complete sets of position eigenstates 


where I is the identity operator. Between the evolution operators appearing on 
the right-hand side of Equation (2.33), there will be N such insertions, i.e. i: 
1— N. Consider one of the matrix elements 

eh(X,P) 


(zile ® zi) (2.35) 


ARP) ÀP) ARP) 
e Roos fe R (2.33) 


between position eigenstates |z;)and |z;-1) for Hamiltonians of the form 


h(X,P) = = TV (XY, (2.36) 
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Then 


(ziļe7 


ÎR, P 


Iziz) = (ail — £ (P?/24+ V(X) lz) +02) 


= f lai p ORAM jee asa) 


57 fo (1- h ` (p /2+V(zi-1))) (zilpi) (pilzi-1) +o(e*) 
2 Zį— Zi 
dpi (3 HV (zi—1) ini! 2 #2) +o(e7) 


2rh)?4 
ip, JE ¿PiCi ži) (===) 
~ | eave 4 
2 
E 1 sekin ve) +o(e?) 
xe (C ) , (2.37) 


where in the second step, we have inserted a complete set of momentum 
eigenstates after letting V(X) act on the position eigenstate |z;—1). The first 
factor in the last equality is just a (shifted) Gaussian integral, and can be easily 
evaluated to give 


etp -iiz 1 q 
€ = Sonne ; (2.38) 


Now we use Equations (2.37) and (2.38) in Equation (2.33), inserting an 
independent complete set of position eigenstates between each of the factors 
to yield 


24-24 2 
—8i(X,P) _ dz,---dzn hE) sven) 
(yle |r) = (nh Z II e 

TNE i=1 


ef SU 
= {Ae geet (2 (=) svenn) 
(rhe) 2 


k: 


(2.39) 


where we define z = x and zy+1 = y. Equation (2.39) is actually as far as 
one can go rigorously. It expresses the matrix element as a path integral over 
piecewise straight (N pieces), continuous paths weighted by the exponential of a 
discretized approximation to the negative Euclidean action. In the limit N > ov, 
the o(€?) terms are expected to be negligible. Additionally, in the limit that the 
path becomes differentiable, which is actually almost never the case, 


> G e=) +V (zi )> > fa (vet jE N+ 520 e). (2.40) 


where 7 € [0,8] parametrizes the path such that z(0) = xz and z(8) = y. Hence 
the matrix element Equation (2.32) can be formally written as the integral over 
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classical paths, 


(yle UP) py) =N f D2(r) e- 4 SE ir (4 e0) Eel) 


=N | De(r)e- SEROL (2.41) 


where Sz[z(7)] is the classical Euclidean action for each path z(T), which starts 
at x and ends at y. Dz(r) is the formal integration measure over the space of 
all such paths and M is a formally infinite or ill-defined normalization constant, 


the limit of — yz as N > o. 
(2rħe) 2 
There exists a celebrated measure defined on the space of paths, the so-called 


Wiener measure |121], which was defined in the rigorous study of Brownian 
motion. One can use it to define the Euclidean path integral rigorously and 
unambiguously, certainly for quantum mechanics, but also in many instances for 
quantum field theory. We are not interested in these mathematical details, and 
we will use and manipulate the path integral as if it were an ordinary integral. 
We will have to define what we mean by this measure and normalization more 
carefully, later. The measure actually only makes sense, in any rigorous way, 
for the discretized version Equation (2.39) including the limit N — co; however, 
strictly speaking the path integral for smooth paths, Equation (2.41), is just a 
formal expression. 
We will record here the corresponding formula in Minkowski time: 


(yle TARP a) =N 1 Dz(t)e Jo #EH)P-V EW) 


=N / Dz(t)en Sml], (2.42) 


This formula can be proved formally by following each of the steps that we have 
done for the case of the Euclidean path integral; we leave the details to the 
reader. However, the Gaussian integral that we encountered at Equation (2.38) 
becomes 


(24 -24-1) 
dp; om (nH) 
. 2.4 
Ne | oa i (2.43) 


This expression is ill-defined, but it only contributes to an irrelevant normaliza- 
tion constant. We can make it well-defined by adding a small negative imaginary 
part to the Hamiltonian, which then yields 


ms RG 
N.= (V=) r (2.44) 
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Adding the imaginary part to the Hamiltonian is known in other words as 
the “i-epsilon” prescription (note this “epsilon” has nothing to do with the e 
appearing in our formulas above). Such a deformation can be effected in the case 
at hand by changing the p; — (1—7€)p; in the exponent of Equation (2.43) with 
infinitesimal € (instead of using the usual “epsilon”). It is tantamount to defining 
the Minkowski path integral by starting with the Euclidean path integral and 
continuing this back to Minkowski space. 

For the remainder of this book, we will be interested in the path-integral 
representation, Equation (2.39), of the matrix element Equation (2.32). We will 
apply methods that are standard for ordinary integrals to obtain approximations 
for the matrix element. We will use the saddle point method for evaluation of the 
path integral. This involves finding the critical points of the Euclidean action and 
then expanding about the critical point in a (functional) Taylor expansion. The 
value of the action at the critical point is a constant as far as the integration is 
concerned and just comes out of the integral. This term alone already gives much 
novel information about the matrix element. It is usually non-perturbative in the 
coupling constant. The first variation of the action vanishes by definition at the 
critical point. The first non-trivial term, the second-order term in the Taylor 
expansion, yields a Gaussian path integral. The remaining higher-order terms 
in the Taylor expansion give perturbative corrections to the Gaussian integral. 
The Gaussian integral can sometimes be done explicitly, although this too can 
be prohibitively complicated. 

We will work with the formal path integral, Equation (2.41), rather than the 
exact discretized version, Equation (2.39). First of all it is much easier to find 
the critical points of the classical Euclidean action rather than its discretized 
analogue. Secondly, in the limit that N — oœ, the critical points for the discrete 
action should approach those of the classical action. The actual path integral to 
be done always remains defined by the discretized version. The critical point of 
the classical action is only to be used as a centre point about which to perform 
the path integral Equation (2.39) in the Gaussian approximation and in further 
perturbative expansion. As stressed by Coleman [31], the set of smooth paths is a 
negligible fraction of the set of all paths. However, this does not dissuade us from 
using a particular smooth path, that which is a solution of the classical equations 
of motion, as a centre point about which to perform the functional integration in a 
Gaussian approximation. The Gaussian path integral corresponds to integration 
over all paths, especially including those which are arbitrarily non-smooth, but 
which are centred on the particular smooth path corresponding to the solution of 
the equations of motion, with a quadratic approximation to the action (or what 
is called Gaussian since it leads to an (infinite) product of Gaussian integrals). 
It actually receives most of its contribution from extremely non-smooth paths. 
However, the Gaussian path integral can be evaluated in some cases exactly, 
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and in other cases in a perturbative approximation. In this way the exact 
definition of the formal path integral, Equation (2.41), is not absolutely essential 
for our further considerations. We will, however, continue to frame our analysis 
in terms of it, content with the understanding that underlying it a more rigorous 


expression always exists. 


3 
The Symmetric Double Well 


In this chapter we will consider in detail a simple quantum mechanical system 
where “instantons”, critical points of the classical Euclidean action, can be used 
to uncover non-perturbative information about the energy levels and matrix 
elements. We will also explicitly see the use of the particular matrix element 
(2.27) that we consider. The model we will consider has the classical Euclidean 
action 


B 
2 1. 
Sele] = fË dr (ZEP +V G0). (3.1) 
TR 
We choose for convenience the domain [-, £] and we will choose the potential 


explicitly later. We will always have in mind that 8 — oo, thus if 8 is considered 
finite, it is to be treated as arbitrarily large. The potential, for now, is simply 
required to be a symmetric double well potential, adjusted so that the energy is 


equal to zero at the bottom of each well, located at +a, as depicted in Figure 3.1. 


3.1 Classical Critical Points 


The critical points of the action, Equation (3.1), are achieved at solutions of the 
equations of motion 


ôSg|z(T)]| 


AE =—z(r) +V (z(r))=0. (3.2) 


2(r!)=a(7") 


We assume Z(7) satisfies Equation (3.2). Then writing z(7) = Z(r) +6z(rT) and 
expanding in a Taylor series, we find 


= a(r 1 a del! 6? Srle(T)] ” 
Pare One +5 fe : d2z(7')d2z(7"") 2(T)=2(T) 
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V(z) 


a a 


Figure 3.1. A symmetric double well potential with minima at +a 


where we note that the first-order variation is absent as the equations of motion, 
Equation (3.2), are satisfied. The second-order variation is given by 


Sele) (umn ag a" 
OEE ean ae TV EO) HH. e 
Then we have 
2 2 
Selz(T)] = Selz(T)] + Jp dréz(T) Ci +V"ar))) 6z(r) +--+. (3.5) 


We can expand 6z(r) in terms of the complete orthonormal set of eigenfunctions 
2n(T) of the hermitean operator entering in the second-order term 


-5 + vec) Zn(T) =Anzn(T), n=0,1,2,3,---,00 (3.6) 


dr? 
supplied with the boundary conditions 
Pins HO 
Zn( z) = a(z) =0. (3.7) 


Completeness implies 
Sy en(T)zn(T') =06(t-7’) (3.8) 
while orthonormality gives 
2 
J p dt 2n(T)Zm(T) = Onm- (3.9) 
~ 2 
Thus expanding 
62(T) = 5 Cn2Zn(T) (3.10) 
n=0 


we find 
Sele(r)] =SelZ()]+ 5 D> An +04) (3.11) 
n=0 
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using the orthonormality Equation (3.9) of the z,(7)’s. 


3.2 Analysis of the Euclidean Path Integral 


The original matrix element that we wish to study, Equation (2.32), is given by 


Bj, 


(yle RPP) |x) = (2(B/2) |e BMX -P)|3(8/2)), (3.12) 


as we have not yet picked the boundary conditions on 7(+6/2). Then we get 


tie aen = oo 2 3 
(z(B/2)|e7 FRP) 3B /2)) ee etn 
= g- SEBOL yy N f Dar) ek Tro banca +o(ch)) | 
(3.13) 


Now we will begin to define the path integration measure as 


= den 
T)—> ; 3.14 
> l (3.14) 


integrating over all possible values of the cn’s as a reasonable way of integrating 


over all paths. The factor of V2zh in the denominator is purely a convention 
and is done for convenience as we shall see; any difference in the normalization 
obtained this way can be absorbed into the still undefined normalization 
constant, M. Then the expression for the matrix element in Equation (3.13) 


becomes 


-$ -SBE yT f A e hE Pantoc 
(2(8/2)|e“ BAP) z(—6/2) = NTIS ey i a a 
(3.15) 


Scaling cn = é,Wh gives for the right-hand side 


-Sako den , = ($Anzto(h) 
ee a II We y ae 
Splz(7)] 


SEBON r 1 
=e n wÅ (a+). (3.16) 


n=0 


This infinite product of eigenvalues for the operators which arise typically does 
not converge. We will address and resolve this difficulty later and, assuming that 
it is so done, we formally write “det” for the product of all the eigenvalues of the 
operator. This yields the formula 


(E(B RRP eya) =e FH" waei- VE ao) 
(3.17) 
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Thus we see the matrix element has a non-perturbative contribution in h 


coming from the exponential of the value of the classical action at the critical 
Splz(7)] 
point divided by h, e~ Ei , multiplying the yet undefined normalization and 


determinant and an expression which has a perturbative expansion in positive 


powers of À . 


3.3 Tunnelling Amplitudes and the Instanton 


To proceed further we have to be more specific. We shall consider the following 
matrix elements 


(taje ERP Ja) = (Fale TRA] — a). (3.18) 


The equality of these matrix elements is easily obtained here by using the 
assumed parity reflection symmetry of the Hamiltonian, 


-A(R Ê 
= (~ae Oy), (3.19) 
where % is the parity operator which satisfies R? = 1, Plz) = | — x) and 
(P, A(X, P)] =0. 
The equation which Z(r) satisfies is 
— z(r)+V'(2(r)) =0, (3.20) 


which is exactly the equation of motion for a particle in real time moving in the 
reversed potential —V (z), as in Figure 3.2. Because of the matrix elements that 
we are interested in, Equation (3.18), the corresponding classical solutions are 


those which start at and return to either +a or those that interpolate between 


-v(a 


Figure 3.2. Inverted double well potential for Z(7) 
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+a and Fa, and each in time 8. The trivial solutions 


Z(T) =r (3.21) 


satisfy the first condition while the second condition can be obtained by 
integrating Equation (3.20). Straightforwardly, 


Z(T)Z(T) = V'(Z(T))Z(T), (3.22) 
which implies 
2(rT) = V2V(z(T)) +2, (3.23) 


where c is an integration constant. Integrating one more time and choosing the 
solution that interpolates from —a to a, we get 


Z(T m t 
-a V2V(Z) + -£ 2 
and c is determined by 
a dz 
4 B. (3.25) 


-a VVO 
We note that this last Equation (3.25) does not depend on the details of 
the solution, but only on the fact that it must interpolate from —a to a. 
Obviously from Equation (3.23), c is the initial velocity. The initial velocity 
is not arbitrary, the solution must interpolate from —a to a in Euclidean time 
B, and Equation (3.25) implicitly gives c as a function of 3. There is no solution 


that starts with vanishing initial velocity but interpolates between +a in finite 
time 8; vanishing initial velocity requires infinite time. 

As 8 — œ, the only way for the integral in Equation (3.25) to diverge to give 
an infinite or very large 8 is for the denominator to vanish. This only occurs 
for V(Z) + 0 and for c> 0. V(Z) +0 occurs as Z + +a, which is near the start 


and end of the trajectory. Also, physically, if the particle is to interpolate from 
—a to a in a longer and longer time, 8, then it must start out at —a with a 
smaller and smaller initial velocity, c. For larger and larger 8, c must vanish in 
an appropriate fashion. Heuristically, for small c, the solution spends most of its 


time near Z = +a and interpolates from one to the other relatively quickly. Then 


the major contribution to the integral comes from the region around Z = +a. 
Since the integral diverges logarithmically when c = 0, for a typical potential 


V (which must vanish quadratically at 7z = +a as V has a double zero at +a), 
the integral must behave as —Inc, i.e. 6 ~ —Inc which is equivalent to c~ e~°, 
which means that c must vanish exponentially with large 8. For sufficiently large 
B we may neglect c altogether. 
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z(t] 


pa 


—-at 


Figure 3.3. Interpolating kink instanton for the symmetric double well 


The action for the constant solutions, Equation (3.21), is evidently zero. For 
the interpolating solution implicitly determined by Equation (3.24), it is 


a 
3 


(G20 +v) 


a 
3 


| 
= 


wo NND no 


(#(r) -e) 
= (/ VETE ar) — Be? 
E u dyta) ~ Be. (3.26) 


For large 8, we neglect c in the integral for S,[Z(rT)] = So, and the term —8c?, 
yielding 


vo NI 


Q 


So = f dz,/2V (2). (3.27) 


—a 

This is exactly the action corresponding to the classical solution for 6 = co 
depicted in Figure 3.3. Such Euclidean time classical solutions are called 
‘instantons”. 


For large 7 the approximate equation satisfied by Z(r) is 


dz 
= =w(a—2z), (3.28) 
obtained by expanding Equation (3.23) as Z — a~ from below and where w? is the 
second derivative of the potential at z= a. There is a corresponding, equivalent 
analysis for rT — —oo. These have the solution 


lz(r)| =a—Ce"l, (3.29) 


Thus the instanton is exponentially close to +a for |r| > 4 . Its size is 4 which 


is of order 1, compared with h and 8. For large |r| , the solution is essentially 


equal to ta, which is just the trivial solution. The solution is “on” only for an 
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“instant”, the relatively short time compared with 8, during which it interpolates 
between —a and +a. Hence the name instanton. Reversing the time direction 
gives another solution which starts at +a and interpolates to —a, aptly called 
an anti-instanton. It clearly has the same action as an instanton. 


3.4 The Instanton Contribution to the Path Integral 
3.4.1 Translational Invariance Zero Mode 


As we have seen, for very large 8, the instanton corresponding to infinite 6 
is an arbitrarily close and perfectly good approximation to the true instanton. 
Evidently with the infinite 6 instanton, we may choose the time arbitrarily at 
which the solution crosses over from —a to +a. The solution of 


z(T) dz 

I VAZO =T— To (3.30) 
corresponds to an instanton which crosses over around T = To . Thus the position 
of the instanton 7) gives a one-parameter family of solutions, each with the 
same classical action. The point is that for large enough £, there exists a one- 
parameter family of approximate critical points with action arbitrarily close to 
So. The contribution to the path integral from the vicinity of these approximate 
critical points will be of a slightly modified form, since the first variation of 
the action about the approximate critical point does not quite vanish. Thus the 
contribution will be of the form, the exponential of the negative action at the 
approximate critical point, multiplied by a Gaussian integral with a linear shift, 
the shift coming from the non-vanishing first variation of the action. The shift 
will be proportional to some arbitrarily small function f(G) as 8 —> oo . The 
higher-order terms give perturbative corrections in h, as in Equation (3.16), and 
can be dropped. Then, considering a typical Gaussian integral with a small linear 
shift, as arises in the integration about an approximate critical point, we have 


35 2 
/ dx e74 (a?z?+2f(6)2) _ one ay (3.31) 
-oo V 27 x 


We see that to be able to neglect the effects of the shift, f(8) must be so small 
that Pts) <1, given that a, being independent of f and 8, is of order 1. 
Typically, f(8) is exponentially small in 8, just as earlier c was found to be. 
f (8) needs to be determined and depends of the details of the dynamics. In any 
case, 8 must be greater than a certain value determined by the value of A. This 
is, however, no strong constraint other than imposing that we must consider 
the limit that 6 is arbitrarily large while all other constants (especially ñ) are 
held fixed. Hence, assuming £ is sufficiently large, we can neglect the effect of the 
linear shift and we must include the contribution from these approximate critical 
points. To do so, we simply integrate over the position of the instanton and 
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perform the Gaussian integral over directions in path space which are orthogonal 
to the direction corresponding to translations of the instanton. 

The easiest way to perform such a constrained Gaussian integral is to use the 
following observations. In the infinite 6 limit, the translated instantons become 
exact critical points and correspondingly the fluctuation directions about a given 
instanton contain a flat direction. This means that the action does not change 
to second order for variations along this direction. Precisely, this means that the 
eigenfrequencies, Àn, contain a zero mode, Ào = 0. We can explicitly construct 
this zero mode since 


(--Sev ar n))) oat) L (Arn) + V(r) =0, 
(3.32) 


the second term vanishing by the equation of motion, Equation (3.20), which is 


clearly also valid for Z(7 — 71). This mode occurs because of the time translation 
invariance when £ is infinite. The corresponding normalized zero mode is 


ed aig per ed (3.33) 
Clearly 
L o (z ar- n) a zS dr (S2r-n) +Vv(ar—n) =I 


(3.34) 
using the equation of motion, Equation (3.23), with c=0 (infinite 8). 
Integration in the path integral, Equation (3.15), over the coefficient of this 
mode yields a divergence as the frequency is zero 
dco 1 dco 
Jaa = A =o 


However, integrating over the position of the instanton is equivalent to 


(3.35) 


integrating over co. 7 is called a collective coordinate of the instanton 
corresponding to its position in Euclidean time. Indeed, if (7 —71) is an instanton 
at position 7,, the change in the path obtained by infinitesimally changing 7, is 


d 
62(7) = ——Z(T — 11)d7 = V So20(T). (3.36) 
dt, 
The change induced by varying co is, however, 
62(T) = zo(T)dco. (3.37) 
Thus 
dco So 


=4/ — dr 3.38 
V 2h 2th ( ) 
and when integrating over the position Tı we should multiply by the normalizing 
So 

2rħ ` 
equivalent infinity obtained when integrating over co. 


factor Clearly for infinite 6 the integral over 7, diverges, reflecting the 
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This divergence is not disturbing, since for a positive definite Hamiltonian the 
infinite @ limit of the matrix element, Equation (2.32), is strictly zero, and for 
large 6 it is an expression which vanishes exponentially. Thus in the large (6 
limit, the Gaussian integrals in the directions orthogonal to the flat direction 
must combine to give an expression which indeed vanishes exponentially with £, 
as we will see. For the time being, for finite 8, the integration over the position 
then gives a factor that is linear in 6 


So 
sxe: (3.39) 


Thus, so far the path integral has yielded 


yo 1 2 are 
(ale FAP) —a) = Er (5) s BN (aer -i +v"(atn))) i , (3.40) 


where det’ means the “determinant” excluding the zero eigenvalue. We will leave 
the evaluation of the determinant for a little later when will show that 


NR 
Nile 


d? d? K 
N | det’ |-— +V” (z(T)) = KN | det |- — +w? ; (3.41) 
dr? dr? 
where w was defined at Equation (3.28), and we will evaluate K, which is, most 
importantly, independent of fA and £. 


3.4.2 Multi-instanton Contribution 


To proceed further, we must realize that there are also other approximate critical 
points which give significant contributions to the path integral. These correspond 
to classical configurations which have, for example, an instanton at 7,, an anti- 
instanton at Tə and again an instanton at 73. If 7; are well separated within the 
interval 8, these configurations are approximately critical, with an error of the 
same order as for the approximate critical points previously considered. More 
generally we can have a string of n pairs of an instanton followed by an anti- 
instanton, plus a final instanton completing the interpolation from —a to a. We 
denote such a configuration as Z2,+41(7). The positions are arbitrary except that 
the order of the instantons and the anti-instantons must be preserved and they 
must be well separated. The action for 2n +1 such objects is just (2n + 1)So to 
the same degree of accuracy. 

One would, at first sight, conclude that this contribution, including the 
Gaussian integral about these approximate critical points, is exponentially 
suppressed relative to the contribution from the single instanton sector. Indeed, 
we would find that the contribution of the 2n + 1-instantons and anti-instantons 
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to the matrix element!, 


le 


(ale TAP) — a)2n+1 = SON (ae E +V’ Emnat] (3.42) 
is suppressed by e~ ~m relative to the one instanton contribution. This is true; 
however, we must analyse the effects of zero modes. 

For 2n + 1 instantons and anti-instantons there are 2n + 1 zero modes 
corresponding to the independent translation of each object. This is actually 
only true for infinitely separated objects with 8 infinite; however, for 8 large, it 
is an arbitrarily good approximation. Thus there exist 2n +1 zero frequencies 
in the determinant which should not be included in the path integration and, 
correspondingly, we should integrate over the positions of the 2n +1 instantons 
and anti-instantons. This integration is constrained by the condition that their 
order is preserved. Hence we get the factor 


£ pent 
faf inf dT3°- fe rn f’ dTən+1 = (Qn n+ D! (3.43) 


Furthermore, from exactly the same analysis as the integration over the position 


of the single instanton, the integration is normalized correctly only when each 


1 
factor is multiplied by (2 So z) ° Thus we find 


E py (we -Beremnta) bu 


where det’ again means the determinant with the 2n +1 zero modes removed. 
We will show later that 


Nir 


N (ae | 5 +V” (Zany (T J)” = K” IN (ae | 5 +w a (3.45) 


for the same K as in the case of one instanton, as in Equation (3.41). 
s 
Now even if e7 ™ is very small, our whole analysis is done at fixed A with 


B — co; the relevant parameter, as can be seen from Equation (3.44), is 


aly 
$< (5 ) PKG, (3.46) 
T 


which is arbitrarily large in this limit. Thus it seems that the contribution from 
the strings of instanton and anti-instanton pairs is proportional to 6?”+! and 


1 Here the subscript 2n + 1 signifies that we are calculating only the contribution to the 
matrix element from 2n + 1 instantons and anti-instantons. 
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seems to get larger and larger. However, the denominator contains (2n + 1)!, 
which must be taken into account. For large enough n, the denominator always 
dominates, 6?”*! < (2n+1)!, and so renders the contribution small. 

We require, however, for the consistency of our approximations that when n 
is large enough so that this is true, the average space per instanton or anti- 


instanton is still large compared to the size of these objects ~ 1/w, which 


B 
> nF? 
is independent of both A and £. This is satisfied as 8 > oo. Hence we require n 
large enough such that 


g2nti 
Qn+D! <1; (3.47) 
however, with 
A (3.48) 
Qnt+17 w 


Taking the logarithm of Equation (3.47) after multiplying by (2n + 1)! yields in 
the Stirling approximation 


(2n+1)Inéd « (2n+1)In(2n +1) — (2n +1). (3.49) 


Neglecting the second term on the right-hand side and combining with 
Equation (3.48) yields 


L 
So \2 _So 
b= sz] € PK |B K2n4+1 Kwp. (3.50) 
27h 
; 3 ; S 3 _ So ‘ 
That such an n can exist simply requires (2) e Tr K &K w. We will evaluate 


K explicitly and find that it does not depend on A or 8. The inequality then is 
clearly satisfied for A — 0, which brings into focus that underneath everything 
we are interested in the semi-classical limit. 

A tiny parenthetical remark is in order: in integrating over the positions of the 
instantons, we should always maintain the constraint that the instantons are well 
separated. Thus we should not integrate the position of one instanton exactly 
from that of the preceding one to that of the succeeding one, but we should 
leave a gap of the order of + which is the size of the instanton. Such a correction 


n-1 


corresponds to a contribution which behaves to leading order as ifm which 


is negligible in comparison to a if 4 KB. 

When the density of instantons and anti-instantons becomes large, all of 
our approximations break down, and such configurations are no longer even 
approximately critical. Thus we do not expect any significant contribution to 
the path integral from the regions of the space of paths which include these 
configurations. Hence we should actually truncate the series in the number of 
instantons for some large enough n; however, this is not necessary. We will 
always assume that we work in the limit that 6 should be sufficiently large and 
h sufficiently small so that the contribution from the terms in the series with 
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Fa) 


Figure 3.4. A simple function analogous to the action 


n greater than some N is already negligible, while there is still a lot of room 
per instanton, i.e. G/N is still large. This should still correspond to a dilute 
“gas” of instantons and anti-instantons. Then the remaining terms in the series 
can be maintained, although they do not represent the contribution from any 
part of path space. It is simply easier to sum the series to infinity, knowing that 
the contribution added in from n greater than some N makes only a negligible 
change. The sum to infinity is straightforward. We find 


a 2 = 3 
(ale RA?) -a) > (x (ae -4 +4?)) ) won (35) etke), 


3.4.3 Two-dimensional Integral Paradigm 


A simple two-dimensional, ordinary integral which serves as a paradigm 
exhibiting many of the features of the path integral just considered is given by 


2 
T= | dedye turer (3.52) 


where y corresponds to the transverse directions and plays no role. f(x) is a 
function of the form depicted in Figure 3.4 and increases sharply in steps of So, 


and the length of each plateau is a In the limit that the steps become sharp, 
the integral can be done exactly and yields 


g- C? Ss np ono 


n! a 


NIK 
a 
aN 
T 
a 
| 
af 
No 


(3.53) 


n=0 
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Obviously this is exactly analogous to the path integral just considered for 6 — co 
and ih 0. The plateaux correspond to the critical points. Clearly we cannot 
consider just the lowest critical point since the volume associated with the higher 
critical points is sufficiently large that their contribution does not damp out until 
n becomes large enough. In terms of physically intuitive arguments, the volume 
is like the entropy factor associated with n instantons, ep while the exponential, 


gee, is like the Boltzmann factor. In statistical mechanics, even though the 
Boltzmann factor is much smaller for higher energy levels, their contribution to 
the partition function can be significant due to a large enough entropy. We can 
further model the aspect of approximate critical points by giving the plateaux 
in Figure 3.4 a very small slope. Clearly the integral is only negligibly modified 
if the slope is taken to be exponentially small in £. 


3.5 Evaluation of the Determinant 


Finally, we are left with the evaluation of the determinant. We wish to show for 
the case of 2n + 1 instantons and anti-instantons 


ak 1 

Nn (act |“ ave *) axa (wv T E 

Eene ©) =a (wae [eo 
(3.54) 
and to evaluate K. Physically this means that the effect of each instanton 
and anti-instanton is simply to multiply the free determinant by a factor of 
red . Intuitively this is very reasonable, and we expect that for well-separated 
instantons their effect would be independent of each other. 

To obtain the det’ we will work in the finite large interval, 8, with boundary 
conditions that the wave function must vanish at the end points. Consider first 
the case of just one instanton. Because of the finite interval, time translation will 
not be an exact symmetry and the operator ae +V"(2(r)) will not have an 
exact zero mode. However, as 8 — oo one mode will approach zero. The det’ is 
then obtained by calculating the full determinant on the finite interval, 6, and 
then dividing out by the smallest eigenvalue. There should be a rigorous theorem 
proving first that the operator in question has a positive definite spectrum on 
the finite interval, 6, for any potential, V(z), of the type considered and the 
corresponding instanton, Z(7), and secondly as 6 — oo, one bound state drops 
to exactly zero; this is reasonable and taken as a hypothesis. Thus we will study 
the full determinant on the interval 8 which has the path-integral representation 


N («ct -4 +V"(ar-n))| ) a 


-if B dar} (2?(r)4+V""(2(r—11))27(7)) 
=N [Dz (3.55) 
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Figure 3.5. The behaviour of V” (z) between +a 


with the boundary conditions that z(4) = z(—£) = 0 in the path integral. The 
path integral on the right-hand side is performed in exactly the same manner as 
in Equation (3.15). This determinant actually corresponds to the matrix element 
of the Euclidean time evolution operator with a time-dependent Hamiltonian, 


B 
12 a(t p24 V” 2-71) $2 
e=or(. ei as Cee *)\ie=0, (3.56) 


where 7 denotes the operation of Euclidean time ordering. This time ordering is 
effectively described by the product representation of Equation (2.33), where the 
appropriate Hamiltonian is entered into each Euclidean time slice. This can be 
shown to give the path integral, Equation (3.55), adapting with minimal changes 
the demonstration in Chapter 2. We leave it to the reader to confirm the details. 

Consider first the behaviour of V”(z) which controls the Euclidean time- 
dependent frequency in the path integral Equation (3.55). V” (+a) = w? is the 
parabolic curvature at the bottom of each well. In between, at z=0, V” (0) will 


drop to some negative value giving the curvature at the top of the potential hill 
separating the two wells. We will have a function as depicted in Figure 3.5. Thus 
V"(z(7)) will start out at w? at T =—oo, until Z(T) starts to cross over from —a 
to a, where it will trace out the potential well of Figure 3.5, and again it will 
regain the value w? for Z(T) =a at T= 00, corresponding to the function of T as 
in Figure 3.6. Thus the path integral in Equation (3.55) is exactly equal to the 
matrix element or “Euclidean persistence amplitude” that a particle at position 
zero will remain at position zero in Euclidean time (6 in a quadratic potential 
with a time-dependent frequency given by V"(Z(r)) depicted in Figure 3.6. 
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Vv"{r]) 


Figure 3.6. The behaviour of V"(z(7) between T = +00 


We will express the matrix element in terms of a Euclidean time evolution 
operator U (4-8) as 


-4 S? ath (2 (r)4V""(2(r7—-11)) 2? (7)) 
2 


= (z=0|U (2,-2)|z=0) (3.57) 


7 (1 p24 V(r—11)) g2 
ulg- =f a G? oe )), (3.58) 


Now 


, 1 
Enti )U (nth n-i) (n-d.-£), (3.59) 


where on the intervals n +355 es and |-$.n — oo we can replace the full 


evolution operator with the free evolution operator 
Lpt y1 2 d2 2,2 AE 
UW (r,r) =T Ta ad ) =e PMP) (3.60) 


as V"(z(rT) is essentially constant and equal to w? on these intervals. Then 
inserting complete sets of free eigenstates, which are just simple harmonic 
oscillator states |En) for an oscillator of frequency w , we obtain 


En 
u (2-8) =P (Fah) (By (mtd as) Em) 


riie aT (3.61) 
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Now we use the “ground state saturation approximation”, i.e. when 6 is huge 
and the instanton is not near the boundaries, only the ground state contribution 
is important. Using this twice we obtain 
U (8-8) melz) E) (Hol dd (atm) Bo) (Eole (#82) 
=U9 (8744915) |Eo) (Eol U? (ntria) |Bo) (Eou? (11-8) x 
x (Eo|U (71+ 9571-25) |Eo) 
(Eo|U? (71+ 245,71 - ats) |Eo) 
x Sou? (2.71425) lEn) (En|U9 (nthi) |Em) (Em|U° (1-4-2) x 
(Eou (atini) |B) 
(BO (isda) [Eo) 
uy? (2, 8) (Eo|U (mı + 3w?) T1 = x) | Eo) 
(Eo|U? (n+ .71- at) |Eo) 
=U (£,-£) «, (3.62) 


where « is the ratio of the two amplitudes over the short time period during 
which V’(Z(r) is non-trivially time-dependent. « is surely independent of the 
position 7, of the instanton. The full evolution operator in fact simply does not 
depend on the position, nor does the denominator. Indeed, 


1 
TI + OG 2 z 
RA ae art (=? d +V"(e(r-n))2") 
U (n+ 71-25) =T | e ea 


1 2 
-4 p20, ar' (1? Se 4v"(a(r!))2?) 
=Tle ~% Z 


; (3.63) 


since the integration variable is a dummy, thus exhibiting manifest 7, 
independence. 

Clearly for n well-separated instantons the result applies also, we simply 
apply an appropriately adapted version of the same arguments. We convert 
the determinant into a persistence amplitude for the related quadratic quantum 
mechanical process, which we then further break up into free evolution in the gaps 
between the instantons and full evolution during the instanton, use the ground 
state saturation approximation, giving the result, to leading approximation 


NIe 
Nie 


N (aot [-& + v"zansatry)}) 


dr? 


=N (ac -4 +arl) Rept, (3.64) 
dr? 

The relationship of « to the K fixed by Equation (3.41) is obtained by dividing 

out by the lowest energy eigenvalue, call it Ao. We will show that this eigenvalue 

is exponentially small for large 6. For 2n + 1 instantons there are 2n +1 such 

eigenvalues which are all equal, in first approximation, and we must remove them 
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all giving 
—1 
; 2 n a det | = + V” (Zan41(7))| ? 
N (a E- +V (anau(r)) ) =N Aer 
a = 1\ 2n+1 
=N («ce | 7 +4?)) («a5 ) (3.65) 
Hence 
1 
K =kàĝ. (3.66) 


It only remains to calculate two things, the free determinant and the correction 
factor K. 


3.5.1 Calculation of the Free Determinant 


To calculate the free determinant, we will use the method of Affleck and Coleman 
[31, 114, 36]. Consider the more general case 


2 


det |- 5 $ wo)| ; (3.67) 


where the operator acts on the space of functions which vanish at +8. Formally 
we want to compute the infinite product of the eigenvalues of the eigenvalue 
problem 

b 


(AWO) aA) a (43) =o. (3.68) 


The eigenvalues generally increase unboundedly, hence the infinite product is 
actually ill-defined. Consider, nevertheless, an ancillary problem 


=1. (8.69 
eee) 


(wo) palT) =ApalT), Wr (-5)=0 Ll) 


There exists, in general, a solution for each A; the second boundary condition 
can always be satisfied by adjusting the normalization. On the other hand, the 


wr (5) =0 (3.70) 


has solutions exactly at the eigenvalues À = Ap. Affleck and Coleman [31, 114, 36] 


equation in A 


propose to define the ratio of the determinant for two different potentials as 


+w) -a| AG) 
+w) -a (E) 


als (3.71) 


a a 
3 +8 


det |- 
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The left-hand side is defined as the infinite product 
rT nA) 
= 3.72 


where the potentials and the labelling of the eigenvalues are assumed to be such 
that as the eigenvalues become large, they approach each other sufficiently fast, 


lim (Ai, — A2) =0 (3.73) 


so that the infinite product in Equation (3.72) does conceivably converge. To 
1 


1, and poles, À = \2, of 
the left-hand side are at the same place as those of the right-hand side, as evinced 


by the solutions of Equation (3.70). Thus the ratio of the two sides 


prove Equation (3.71) we observe that the zeros, À = À 


œo (AL=A 
Hra Gaa 


=g(à 3.74 
OAO g(A) (3.74) 
defines an analytic function g(A) without zeros or poles. Now as |A| — œœ in all 
directions except the real axis, the numerator in Equation (3.74) is equal to 1. 
For the denominator, as A — oo the potentials W; and W2 become negligible 
perturbations compared to the term on the right-hand side of Equation (3.69), 
which we can consider as a potential —A. Neglecting the potentials, clearly 
Wy (5) and Y? (5) approach each other, and hence the denominator also 
approaches 1 in the same limit. Therefore, g(\) defines an everywhere-analytic 
function of À which approaches the constant 1 at infinity, and now in all directions 
including the real axis, as it does so infinitesimally close to the real axis. By a 
theorem of complex analysis, a meromorphic function that approaches 1 in all 
directions at infinity must be equal to 1 everywhere 


g(r) =1 (3.75) 


establishing Equation (3.71). Reorganizing the terms in Equation (3.71), formally 
we obtain 


7 B 7 ® ; (3.76) 
A\ 2 A\ 2 
where both sides are constants independent of the potentials W;. 
We now finally choose M by defining 
det |-% + w(r)| 
= 2nhN? (3.77) 


vo(5) 
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and we will show that this choice is appropriate. Then 


Nie 


Wabi -4 +u? = (anh (2))72, (3.78) 


dr? 


where ~§(7) is the solution of Equation (3.69) for the free theory. It is easy to 
see that this solution is given by 


vo(T) = L sinh (7 + 3 (3.79) 


giving 


1 
=1 d? 2 eeb — e~¥B\\ a WwVe ase 


We can compare this result with the direct calculation of the Euclidean 


z=0) 
0 


Se pO EN MieSO, Gia} 


persistence amplitude of the free harmonic oscillator. We find 


B{_n2 d 11,22 
e BX 2 arta” 


ai d? 


where |) is the ground state. Clearly the normalized wave function is 


1 
W\F _w2 
(x|Eo) = (=) eh (3.82) 
while A 
Eo = zw (3.83) 
giving 
1 
wW\4 
(x = 0| Eo) = (=) (3.84) 
Hence Equation (3.81) yields 
if ae 3 wN? wb 
Ndet™? -atu = (=) e 2 (3.85) 


in agreement with Equation (3.80), and confirming the definition of the 
normalization M chosen in Equation (3.77). 


3.5.2 Evaluation of K 


Finally we must evaluate the factor K. K is given by the ratio 


1 det’ -6 PV = n1)| 
pF = (3.86) 
det | = +w?] 
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from Equations (3.64) and (3.66) for n =0. Thus 


8 
z= Ea (3.87) 


where ào is the smallest eigenvalue in the presence of an instanton. To calculate 


wo (5) and ào approximately we describe again the procedure given in Coleman 


[31]. First we need to solve 
(8? + V"(2(r))) do(r) =0 (3.88) 


with the boundary conditions %9(—(/2) =0 and 0,7~o(—6/2) = 1. We already 
know one solution of Equation (3.88), albeit one that does not satisfy the 
boundary conditions: the zero mode of the operator in Equation (3.30) due to 
time translation invariance, we will call it here x1(r): 


1 dz 
ry(T) = Te (3.89) 


a(t) + Ae™!ITl as r — too. A is determined by the equation of motion, 


Equation (3.30), which integrated once corresponds to 
žz(T)=vV2V(z(T)). (3.90) 


Once we have A we can compute Y(Ê) and Xo. 
We know that there must exist a second independent solution of the differential 
Equation (3.88), yi(7) which we normalize so that the Wronskian 


r1- —y, — = 2A’. (3.91) 


We remind the reader that the Wronskian between two linearly independent 
solutions of a linear second-order differential equation is non-zero, and with no 


first derivative term, as in Equation (3.88), is a constant. Then as tT + +00 we 
have 


j (T)+wy (T) = 2Awe®"| (3.92) 


using the known behaviour of x1(7). The general solution of Equation (3.92) is 
any particular solution plus an arbitrary factor times the homogeneous solution 


yi(T) = tAe#!7! 4+ BeF, (3.93) 


where B is an arbitrary constant. Evidently the homogenous solution is a 


negligible perturbation on the particular solution, and yı(T) > +Ae“!7! as 


T — +00. Then we construct po(T) as 


wo(T) = a (elelr) + en Ala (r)) ; (3.94) 


8.5 Evaluation of the Determinant 33 


verifying 
wbo(—B/2) = =~ (Pa (8/2) +y (—B/2)) 
2wA 
ca (eve? 2 Ae “8/2 +e wB/ (—A)e#/?) =0 (3.95) 
2wA 
while 
dwo(—8/2) 1 g/2 a = d z 
w Z Ae®T wB/2 ” (_ A wT =n 
dr z£ 2wA T z£ F A Je z£ 
(3.96) 
Then it is also easy to see 
1 
o(8/2) = = (3.97) 


which we will need later. 

We also need to calculate the smallest eigenvalue Ao of Equation (3.69). To 
do this we convert the differential equation to an integral equation using the 
corresponding Green function. The Green function satisfying the appropriate 
boundary conditions is constructed from 2 1(r) and yı(T) using standard 
techniques and is given by 


0 r<7! we 


Orr) = oe tanl) T> 


Then the differential equation is converted to an integral equation 


palT) = Yo(T) + a dr' (xı (T yı (T)— yı (Telr ypa (T) 
aa J- dr (zı(T y(r) — yr(7’)axr(r))vo(7"). (3.99) 


= polr) + 


This wave function vanishes for the lowest eigenvalue Ao (and actually for all 
eigenvalues àn) at T = 6/2 by Equation (3.70), thus 


o(6/2) + 533 a dr' (æ (T'u (6/2) — a (7")(8/2))do(1!) 


E 
z | fo dr" (x1(r')y1 (8/2) — yr (7')21(8/2)) 


1 
my (elelr) +67 P/2y,(7')) 


2wA 
1 À 2 7 A ,wB/2 1\ ,—wB/2 
a one |e dr'(a1(7')e —yi(r'Je ) 
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1 AÀ 
pp fg dr'x2 1 wB- ebo. 
w wats fp er Ww ZAW j 
(3.100) 


In the penultimate equation, we can drop the second term because it behaves at 
most as ~ ĝ, since yı (T) ~ ef/2 at the boundaries of the integration domain at 


+8/2, while the first term behaves as ~ e° since f x?(r)dr is normalized to 1. 
This gives quite simply 
Ao S 4A2e-#F, (3.101) 


Then finally we get 


(RED \ e/a oy 
es e a ~ (1/w4 A2e-#8) oe (3.102) 


Thus we have found that the matrix element 
SANG 1 B 
—Bh(X,P)/h) a) = sj 20 —S0/ħ9 42 (=) 2 —w8 
(ale |—a) =sinh (5) e 2A s) TR e “2. (3.103) 


To see explicitly see how to compute A, we can consider a convenient, 


completely integrable example, V(x) = (y?/2)(x? — a?)?, which has w? = 
P y g ple, y , 


V" (+a) = (2ya)?. Then Equation (3.30) yields 


2Z(T—-71) dz 
ST 3.104 
i sea ne 
with exact solution 
2(T) =atanh(ay(t —71)). (3.105) 
Thus A is determined by 
z(r) ary 
r1(T) = = 5 3.106 
(7) VSo VS cosh? (ay(r — 71)) ( ) 
which behaves as 
4a? 2 
lim zı(T)= Za Temani = E owl = Ae, (3.107) 


TPEECO VSo VSo 


Vo is calculated from Equation (3.27), giving 


į 4 2 
y= f dzy(z? — a”) = —ya? = —wa?. (3.108) 


Hence A = —2%— = ,/ £, for this example. 


o/ (2/3)wa2 w? 
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3.6 Extracting the Lowest Energy Levels 


On the other hand, the matrix element of Equation (3.103) can be evaluated 
by inserting a complete set of energy eigenstates between the operator and the 
position eigenstates on the left-hand side, yielding 


(ale PEPIN — a) = e-PFO/ (al Eo) (Eo] — a) +e P™1/™(a By) (Ea|—a) +, 
(3.109) 
where we have explicitly written only the first two terms as we expect that the two 


classical states, | +a), are reorganized due to tunnelling into the two lowest-lying 
states, |Eo) and |E,). Indeed, comparing Equation (3.103) and Equation (3.109) 
we find 


1 
h So \2 
igen (2) ? 750/2? (3.110) 
while 
1 
h So \2 
B= gota ae) ee none (3.111) 


It should be stressed that our calculation is only valid for the energy 
difference, not for the corrections to the energies directly. Indeed, there are 
ordinary perturbative corrections to the energy levels which are normally far 
greater than the non-perturbative, exponentially suppressed correction that 
we have calculated. However, none of these perturbative corrections can see 
any tunnelling phenomena. Thus our calculation gives the leading term in the 
correction due to tunnelling. Thus, the energy splitting which relies on tunnelling 
is found only through our calculation, and not through perturbative calculations. 

We also find the relations 


1 
(a|Eo)(Eo| — a) = (=)? (3.112) 
in addition to " 
(aE) (E-a) =- (=)? (3.113) 
while a simple adaptation of our analysis yields 
1 
(a|Eo) (Eola) = (=)? (3.114) 
in addition to i 
(a| E1)(E1ļa) = (=) a (3.115) 


These yield (Eo| — a) = (Eo|a) while (£,| — a) = —(£,|a) which are consistent 
with |Eo) being an even function, i.e. |Eo) being an even superposition of the 
position eigenstates |a) and |—a) while |E1) being an odd function and hence 
an odd superposition of these two position eigenstates. 


36 The Symmetric Double Well 


Figure 3.7. A generic periodic potential with minima occurring at na with 
n E€ Z, where a is the distance between neighbouring minima 


3.7 Tunnelling in Periodic Potentials 


We will end this chapter with an application of the method to periodic potentials. 
Periodic potentials are very important in condensed matter physics, as crystal 
lattices are well-approximated by the theory of electrons in a periodic potential 
furnished by the atomic nuclei. The idea is easiest to enunciate in a one- 
dimensional example. Consider a potential of the form given in Figure 3.7. A 
particle in the presence of such a potential with minimal energy will classically, 
certainly, be localized in the bottom of the wells of the potential. If there is no 
tunnelling, there would be an infinite number of degenerate states corresponding 
to the state where the particle is localized in state labelled by integer n € Z. This 
could also be a very large, finite number of minima. However, quantum tunnelling 
will completely change the spectrum. Just as in the case of the double well 
potential, the states will reorganize so that the most symmetric superposition 
will correspond to the true ground state, and various other superpositions will 
give rise to excited states, albeit with excitation energies proportional to the 
tunnelling amplitude. The tunnelling amplitude is expected to be exponentially 
small and non-perturbative in the coupling constant. 

As in the case of the double well potential, the instanton trajectories will 
correspond to solutions of the analogous dynamical problem in the inverted 
potential in Euclidean time (as depicted in Figure 3.8), where the trajectories 
commence at the top of a potential hill, stay there for a long time, then quickly 
fall through the minimum of the inverted potential, and then arrive at the top of 
the adjacent potential hill, and stay there for the remaining positive Euclidean 
time. 

For the simple, real-time Lagrangian 


1 
L= xt —V(2); (3.116) 
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Figure 3.8. The inverted generic periodic potential with maxima occurring at 
na with n€Z 


dx(t) 


where t = —,,~, while the Euclidean Lagrangian is simply 
1 
L= xt +V(2), (3.117) 
where t = do), As V(na)=0 for n € Z, we impose the boundary conditions 


r(T=-—œ)=na but «(7 =0o) = (n+1)a for an instanton and x(t =00) = (n—1)a 
for an anti-instanton and look for solutions of the Euclidean equations of motion 


oo) ~V'(a(r)) =0. (3.118) 


This immediately affords a first integral; multiplying by «(r) and integrating 
gives 

1 

5% (7) -V(a(r)) =0, (3.119) 
where we have fixed the constant with the boundary conditions. This equation 
admits a solution in general, the instanton, but it does depend on the explicit 
details of the potential. However, we can find the action of the corresponding 
instanton, which only depends on an integral of the potential, by first isolating 


i= /2V (€), (3.120) 


and then 
So = I ari? +V(a2) = I dr (Sev) + VV) 
= [7 uyr (3.121) 


a 
Although we may naively want to compute the amplitude for tunnelling 
between neighbouring vacua, it is actually more informative to compute the 
amplitude for a transition from vacuum n to vacuum n +m. Naively we would 


38 The Symmetric Double Well 


approximate this amplitude by summing over any number of pairs of widely 
separated instanton anti-instanton configurations appended by a string of m 
instantons. However, this logic would be faulty. There is no reason to restrict the 
order of the instantons and anti-instantons except that they should tunnel from 
the immediately preceding vacuum to an adjacent vacuum, and finally we should 
arrive at the minimum indexed by n +m. Thus, one can choose the instantons 
or anti-instantons in any order, as long as they start at n and end at n+™m. 
This means that if there are N instantons, which must be greater than m, then 
there must be N — m anti-instantons. Thus there are as many distinct paths of 
instantons as there are ways to order N plus signs and N — m minus signs. This 
gives a degeneracy factor of 
(2N —m)! 
NN- m)! 


Furthermore, when we integrate over the Gaussian fluctuations for each instanton 


(3.122) 


or anti-instanton, we get the usual determinantal factor K for each instanton or 
anti-instanton, but we do encounter one zero mode corresponding to each one’s 
position, which we omit in the determinant. Then we integrate over the positions 
of the instantons and anti-instantons, except that the position of each instanton 
or anti-instanton must occur at the position after the preceding one, as the 
instantons and anti-instantons correspond to specific tunnelling between specific 
vacua. This gives the integral 


B/2 B/2 B/2 g2N—m 
d dTa- dTəN-m = >. 3.123 
n 8/2 af É i ee (2N —m)! 28) 


2N-1 
As usual, the action for any instanton or anti-instanton is the same and equal to 


So. Thus, for N instantons and N — m anti-instantons we get 


= 2N—m 
—BR(X,P)/h) y — -(2N-m)So/hp2N-m (2N—m)! 6B 
Iny= oe o/ħ g NIN- m! ON- m)! 
(3.124) 


This sum is unclear for identifying the underlying spectrum and the contribution 


(n+mle 
N=m 


of each energy eigenstate; however, if we re-write the sum as a double sum 
over N instantons and M anti-instantons with a constraint M = N — m we 


have 
(n+ mle“ BX P)/Ai ny _ (2)? ezton 
wh 
x Sve -(s40NSo) gira BA ON—m,M; (3.125) 
N,M=0 


where w? = V” (na). Now the Kronecker delta can be expressed via its Fourier 
series as 


27 0 i 
ÔN-m,M = a a iON —m—M) (3.126) 
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E(@) E(0) 


0 
Figure 3.9. The energy band as a function of 0 
and so we easily find 
(en AA = _ 
= (=) Je —~Bw/2 5S (Kee uh Nam ci(N-m—M) 
-Arh N!M! 
T N, 3 
= a —Bw/2 pa “Ona 5 (Kae ah et eif(N-M) 
wh o 2r N!M! 
N,M=1 
_ o —Bw/2 m dO -imo ,(K BeSt) | (pe 50/Pe- #7) 
wh o 20 
=(4)"e ~pu/2 i” dO imo, (KBe~*0/"(e¥ +e“) 
wh 0 Qn 
1/2 d i e—50/F cos 
= (=) e—Bw/2 [ dB imo (2K Be $0!" cos) (3.127) 
Th o 2r 


But this expression for the matrix element has a clear interpretation in terms of 
the spectrum. We see that the spectrum has become a continuum, parametrized 
by 0. If we write 


dË -BEO in 4 m|B(6))(E(8)|n) (3.128) 


MEES 27 
(nime n JS 
0 T 


we identify 
E(0) = hw/2—2hKe~%0/" cos0 (3.129) 
and ee 
Ww 1/2 enw 
E(9))(E(9)|n) =(=) —— 
(n+ m|E(0))(E(8)In) = (=), 
which affords the identification 


1/4 e ind 


WEO = (E) ee (3.131) 
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Thus our infinitely degenerate spectrum of discrete classical vacua has turned 
into a continuum of states, what is called a band in condensed matter physics, 
with an energy that varies as cos@, as depicted in Figure 3.9. The states are 
now in a continuum, and hence must be normalized in the sense of a Dirac delta 
function rather than a Kronecker delta. The amplitude of the band 2hKe~%0/" 
contains the tell-tale factor of the exponential of minus the Euclidean action, the 
hallmark of a tunnelling amplitude. 

We will see in future chapters that periodic potentials appear commonly and 
play an important role in various instanton calculations. 


A 
Decay of a Meta-stable State 


In this chapter we will consider the decays of meta-stable states and calculate 
the lifetime for such a state using instanton methods. A meta-stable state arises 
due to the existence of a local minimum of the potential, which is not the global 
minimum. This corresponds to a potential having the form given in Figure 4.1. 
The potential rises steeply to infinity to the left and to the right; after the 
potential barrier, it goes down well below the energy of the meta-stable state, 
either eventually going to constant or it may even rise to plus infinity again in 
order to give an overall stable quantum mechanical problem. However, exactly 
what the potential does to the right is considered not to be important; the 
behaviour of the potential to the right is assumed to have a negligible effect on 
the tunnelling amplitude for a particle initially in the local minimum at z = 0 
escaping to the right. We have drawn the potential, in Figure 4.1, so that it simply 
drops off to the right and we have normalized the potential by adding a constant 
such that the local minimum has V(0) =0 . Physically we are considering a 
potential of the type where a particle is trapped in a local potential well, but 
once the particle tunnels out of the well, it is free. The probability that the 
initial state is regenerated from the decay products is assumed to be negligible. 
This is in contra-distinction to the problem considered in Chapter 3 with two 
symmetric wells. Here the tunnelling-back amplitude was sizeable, corresponding 
to the anti-instanton, and had to be taken into account. 


4.1 Decay Amplitude and Bounce Instantons 


In this chapter, we will attempt to calculate the amplitude 


[z(7)] BE 
rT =e © |(Eo |z =0)? + 


(4.1) 
From this amplitude we expect to be able to identify and calculate the energy 


a oy a s 
<z=0]e REP =0>=N | Dele = 
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V(z) 


Unstable /Vacufim 


Figure 4.1. A potential with a meta-stable state at z = 0 that will decay via 
tunnelling 


Unstable |Vacuum 


Figure 4.2. The flipped potential for instanton Euclidean classical solution 


Eo for the ground state. For a stable state, localized at z = 0, we expect Eo, in 
first approximation, to correspond to the ground-state energy of the harmonic 
oscillator appropriate to the well at z = 0, and |(Ep |z =0)| to be the magnitude 
of the ground-state wave function at z = 0. Now because of tunnelling we imagine 
that Eo gains an imaginary part, Eo > Eo + ir /2. We will directly attempt to 
use the path integral, and calculate it in a Gaussian approximation about an 
appropriate set of critical points, as in Chapter 3. 

The equation of motion corresponds to particle motion in the inverted potential 
—V (z), as depicted in Figure 4.2, with boundary condition that 2(+4) = 0. 
There are two solutions, the trivial one z(r) =0 for all 7, and a non-trivial 


true instanton solution Z(7). Here the particle begins at T = -£ with a small 


positive velocity at z = 0, falls through the potential well and rises again to 
height zero at z = zo, at around 7 = 0, and then bounces back, reversing its 
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zbounce [t ] 


To 


Figure 4.3. The bounce instanton which mediates tunnelling of a meta-stable 
state 


steps and arriving at z = 0 again at T = B . Clearly from symmetry such a 
solution exists if 3 is sufficiently large. We call this instanton, after Coleman, the 
bounce, 7?°°°(7). The action for the bounce essentially comes from the short 
time interval during which the particle is significantly away from z = 0. One 
can easily show that the bounce is exponentially close to zero except for a region 
around 7 =0 of size +, where again w? = V” (0). We call the action for the bounce 
So = Sp[z?e"’(r)] for the case 8 = œo. Due to the time translation invariance 
in the G = œ case, again, there exists a one-parameter family of configurations, 
approximate bounces, which correspond to the bounce occurring at any time 
To €E [-§, £]. The action for these configurations is exponentially close to Sp and 
hence the degeneracy is 3. Furthermore, approximate critical configurations also 
exist corresponding to n bounces occurring at widely separated times with action 
exponentially close to n.So. The degeneracy of these configurations is i as they 
are exactly analogous to identical particles. Thus we expect the matrix element 


to be expressable as 


-1 oo 1 n nS 
(esaj BHXP), 9) =N | det ae ° >. So n e ne x 
7 7 dr? = on n! 
1 
d t’ -i H W (zhounce(r))] 2 
x 
det R +w?] 
1 
4 5 
= (E) P eP eN are, (4.2) 


where 


ra (det! -6 + pra a 


(det |- ue]? 
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Here the prime signifies omitting only the zero mode. We will find that the 
situation is not that simple, and we must also deal with a negative mode. Then 


1 
= W So 2 _ So 


and we look for an imaginary contribution to K. 


we would find 


4.2 Calculating the Determinant 


The situation is actually more complicated than is apparent. K comes from 
the determinant corresponding to integration over the fluctuations around the 
critical bounce 


1 
P -3 den 11 2 1 
ry] 117 =bounce = n hi Zn Ancn = 
(«et | qt’ (z o) / I] Da 2 I] To 
An Z0 An Z0 
(4.5) 


The n’s corresponding to vanishing A,,’s are excluded, which is the meaning of 


the primed determinant. This time, however, the problem is much more serious. 
One of the \,,’s is actually negative. For this An the integration over the cn simply 
does not exist, and hence the determinant, as we wish to calculate it, does not 
exist. It seems our original idea is doomed. But there is a possible solution: 
perhaps we can define the integration by analytic continuation. Indeed, analytic 
continuations of real-valued functions often gain imaginary parts, exactly what 
we desire. This analytic continuation is in fact possible and we will see how we 
can perform it appropriately. 


4.3 Negative Mode 


First we will establish the existence of the negative mode. For 6 = co we have 
an exact zero mode due to time translation invariance 


a? =bounce d x pounce 
(Servi geeen) ) Bremen) 


d d? -bounce I / -bounce 
sa Naa AEF E EEEn (4.6) 


where the second term vanishes as it is the equation of motion. Since 7>°%°°(r) 


has the increasing and then decreasing form given in Figure 4.3, this implies 
ghounee (7) has the form given by Figure 4.4. In contra-distinction to the zero 
mode of Chapter 3, this zero mode has a node, i.e. it has a zero. This is intuitively 
obvious, the velocity of the particle executing the bounce will vanish exactly when 
it reverses direction. The analogous quantum mechanical Hamiltonian 
o eC “fs y” (zeoe (7)) (4 7) 
dr? : 
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dz bounce 

Z 

— T 
(E) 


qzoounce (T) 


Figure 4.4. The derivative of the bounce T 


V” (zbounce [t ] ) 


Figure 4.5. The form of the potential V’(z?°""°*(r)) 


has the potential given by Figure 4.5. One expects the spectrum to consist of 
a finite number of bound states and then a continuum beginning at w?. The 
ground-state wave function must have no nodes. The next bound energy level, if 
it exists, will have one node. We have already found a bound-state wave function 
with energy exactly zero, but it has one node. Thus there exists exactly one 
bound-state level, the nodeless ground state, with negative energy. The Gaussian 
integral in this direction in function space does not exist, and we must only define 
it through analytic continuation. 
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4.4 Defining the Analytic Continuation 
The original idea that the matrix element has an expansion of the form 


_ (E+il) 8 


|lz=0)=e7 Tr (O/E+ il) (E+ iT] 0) +--- (4.8) 


(emer 2 
was ill-conceived. There is no eigenstate of the Hamiltonian corresponding to the 
meta-stable state. The Hamiltonian is a hermitean operator with all eigenvalues 
real, an eigenstate with a complex eigenvalue simply does not exist. We can 
only obtain the imaginary energy of the meta-stable state through analytic 
continuation. We imagine the analytic continuation in a parameter a which starts 
at a=0 with a potential with a stable bound state localized at z = 0, but yields 
our original potential at a = 1. The energy of the bound state will also be an 
analytic function of the parameter a. As long as a true bound state exists around 
z=0, this energy will be a real function of the parameter a. When the parameter 
is continued to yield our original potential where the bound state becomes meta- 
stable, we expect that this energy as an analytic function of the parameter a 
will not remain real and will gain an imaginary part. This imaginary part should 
correspond to the width of the meta-stable state. These general considerations 
correspond to a sequence of potentials, as shown in Figures 4.6, 4.7 and 4.8. 


4.4.1 An Explicit Example 


We will confirm these ideas with an explicit demonstration in a specific solvable 
potential. The example we consider is 


V(a,z)=- (a- 5) zf +w’? (4.9) 


V(a=0, z) 


Figure 4.6. The potential with a stable state at z = 0 for a = 0 
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v( Q critical 2) 


Figure 4.7. The critical potential with a stable state at z = 0 for Qeritical 


Figure 4.8. The potential with a meta-stable state at z = 0 for a = 1 


and the integral 


T(a,w) = i er eae ee). (4.10) 
— 00 

which is analogous to the integral over the direction corresponding to the negative 
mode in the definition of the determinant, when a = 1, as depicted in Figures 
4.9 and 4.10. The integral is defined for œ < ł and, specifically, it is not defined 
for a=1. The integral is actually well-defined for complex a, with the condition 
Refa} < 4. We can define the analytic function Z(a,w) for Re{a} > 4 by analytic 
continuation. In this simple case we have no difficulty whatsoever, for Re{a} < i, 
the integral is known in terms of special functions, 


T(a,w) =; ee Ky (aia): (4.11) 
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V(a=0,2)=.524 wz? 


Stable Vacuum 


Via=.5,2)= wz? 


Figure 4.9. The potential with a stable state at z=0 for a = 0 and a=.5 


Via = .5 +€,2)|= -24 + w? 2? 


Unstable] Vacuum 


Figure 4.10. The potential with a meta-stable state at z = 0 for œa = .5 +€ and 
fora=1 


where s 

K,(z) = ser (J, (iz) + N,(iz)) (4.12) 
is the modified Bessel function of imaginary argument. The expression in 
Equation (4.11) has a well-defined analytic continuation throughout the complex 


a-plane, except on the real a-axis, where starting at a = $, 


there is a 
branch cut. 

But in general, we do not have the luxury of knowing the integral exactly. There 
is, however, a method for performing the analytic continuation more implicitly. 
Happily, this method allows us to extract the information that we actually seek, 


the imaginary part of the energy. We apply the method to the specific, exactly 
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Integration Contours 


Figure 4.11. The original integration contour C, along the real line and the 
deformed contour A, a straight line at an angle 0, for the analytic continuation 


solvable integral of Equation (4.11) to see in detail how the implicit method 
works. Indeed, we can obtain the analytic continuation of a function defined by 
a contour integral, by deforming the integration contour. In our example 


(4.13) 


NI = 


L(a,w) af da RAE A] tes Real(a) < 


corresponds to the integration contour C along the real axis, in Figure 4.11. 
The integration is defined for larg (- (a = $))| < 5. We deform the contour to 
E+A+E’ as in Figure 4.11, the integral is invariant since there are no poles 
in regions R and R’ and if the contributions from the circular arcs EF, E’, vanish 
for infinite radius, which is assumed to be true, we get 


4 2,2 
T(a,w) = Fa (laa) vs ) (4.14) 
z=re9 
T 
2 
we must have that —(a— 4 )e’*’r* has a positive real part. Thus 


But now the integration converges for larg ( (a $)) 40| < 2 since after 


replacing z = re’? 
a deformation of the contour defines an analytic continuation of the integral in 


the parameter a. If we take 0 = 7 then larg ( (a $)) | < 5. This implies 


w(-(-§)G3) a 


hence the integral is now defined for Re{— (a = 5)} <0, which is negative. This 


means Re{a} > $. Thus we define, with A corresponding to the contour with 
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0 =7/4, 
T(a> h = J daa A es) (4.16) 
2 A 
This is an exact expression for the analytic continuation of the original integral 
TI(a< Ew) >T(a> 5,W) , and there is no question as to its existence. However, 
we wish to actually evaluate the integral in the approximation as A > 0 and 
extract only the imaginary part. 


4.5 Extracting the Imaginary Part 


Consider the part of the contour A from 0 to oo in the first quadrant. The 
other half of the contour clearly gives the same contribution. We will calculate 
this integral approximately using the method of steepest descent, which is the 
indicated approximation method in the limit A —> oo. To do this, we further 
deform the contour from its present path between 0 and oo x ef? to the path of 
the steepest descent between these points. As there are no poles in the integrand, 
the integral clearly is invariant under this additional deformation. 


4.5.1 A Little Complex Analysis 


A contour of the steepest descent for the real part of a complex analytic function 
keeps the imaginary part constant (and vice versa). We can easily demonstrate 
this fact. If we have f(x,y) = R(x,y) +iI(a,y) and a curve parametrized by a 
variable t, (a(t), y(#)) with tangent vector (a(t), y(t)), the curve will correspond 
to the steepest descent of the real part R(x, y) if the tangent vector is anti-parallel 
to its gradient, as the gradient points in the direction of maximum change. 
Therefore, 


(0, R(a,y),0,R(a,9)) x EEIE) = Or R(w,y) g(t) — Oy R(w,y)4(t) =0. (4.17) 


Due to analyticity, the Cauchy-Riemann equations give 


thus Equation (4.17) gives 


OyT (x,y) y(t) — (—O21(@,y)) a(t) = © Ten) = 0. (4.19) 


But this means I(t) = constant, demonstrating that the imaginary part of the 
complex analytic function remains constant on the paths of steepest descent of 
the real part. 

In general for an integral of the form 


b 
T= J dzeòf 2) (4.20) 
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we can describe the process of the method of steepest descent as follows. For 
the application of the method of steepest descent, a should be a critical point 
of f(z). We assume Re{ f(a)} > Re{f(b)} and Im{f(a)} > Im{ f(b)} and we 
start from a following the path of steepest descent of the Ne{f(z)} to a’ where 
Re{ f(a’)} = Re{f(b)} and then we append the path of steepest descent of 
Im{f(z)} from a’ to b along a path where now only the imaginary part of f(z) 
changes. If Im{f(a)} < Im{f(b)}, we obviously ascend the appropriate portion. 
In the limit A — oo it is only the first contour which is important, since the second 
is multiplied by e>*¢t/ (a')} K @rMe{ f(a}, Finally we perform the integration over 
the first contour in the Gaussian approximation about z= a. 

There are two further points to be made. First, we are actually only interested 
in the imaginary part of the integral, as it is only this part that we believe will 
have a leading contribution that is non-perturbative in A. Second, and this is 
very important to the first, if the path of steepest descent of the real part of f(z) 
passes through an ordinary critical point of f(z), it abruptly changes direction 
by 90°. We can demonstrate this easily. An ordinary critical point of f(z), which 
requires f’(zo) =0 and assumes f”(zo) #0, implies the behaviour 


FE = Heo) + $F" (20) (2-20)? (4.21) 
Replacing z— zo =x+iy we get 
f(a +a+iy) = f(z) + Toe =y? +2isy) +. (4.22) 


Then paths of steepest descent passing through the critical point are paths of 
the constant imaginary part of f(zọ +x + iy) passing through x = y = 0, i.e. 
Im{ f (zo +x +iy)} =Im{f(z0)}. Therefore, to lowest non-trivial order, we need 
paths with Im{ f” (zo) (x? — y? + 2ixy)} =0. If f” (zo) =r +is this gives 


s(x? — y?) + 2ray = 0. (4.23) 


If s = 0, the solutions are x = 0 or y=0, which are perpendicular horizontal or 
vertical lines, respectively, hence crossing at 90°. Assuming s Æ 0, 


a +2 oyt (2) =p (1+(2)’). (4.24) 


This gives 


(«+ =) = $y S (4.25) 


that is the curves, which are just the straight lines 


sa ( (10) +2) (4.26) 
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with the + signs correlated. The tangents at x = y =0 are given by the directions 


1 
£ (1 + Œ’) J= :) ; i) . These are clearly orthogonal, as their scalar product 


-((+()')'-2) (E +2) 41-0 (4.27) 


Thus in complete generality, the paths of steepest descent turn abruptly by 90° 


vanishes, 


as they pass through an ordinary critical point. 

The real or imaginary parts of complex analytic functions are called harmonic 
functions, which means that they satisfy V? R(x, y) = V?I (x,y) =0. As should be 
well known, all critical points of the real or imaginary parts of complex analytic 
functions are saddle points. Then a path of steepest descent, which descends 
through an ordinary critical point, must change direction by 90°, since continuing 
in the same direction through the critical point would correspond to ascending 
the other side of the saddle. Turning through 90° continues the descent through 
the saddle point. The above analysis shows that for an ordinary critical point, 
f'(zo) =0 but f” (zo) £0, the minimum and maximum directions are at 90° to 
each other. 

For our integral Equation (4.16), the real part of the exponent changes from 
0 — —o as z varies from 0 — oo x e*4,, while the imaginary part of the exponent 
is equal to 0 at z = 0 but becomes arbitrarily large at z = oo x e3. Thus in this 
case, along the path of steepest descent of the real part, the imaginary part of the 
exponent will always be equal to 0, since it must remain constant and it vanishes 
at the initial point. Such a path will reach a point where Re{ f(zo)} = —oo. 
Then further following a contour with fixed real part, equal to —oo, but changing 
imaginary part will be irrelevant since the factor corresponding to the exponential 
of the real part will already be zero. 

Our function actually has three critical points. Indeed, 


“ ((« 5) ua?) =1(a-5) z2 —Qw*z=0 (4.28) 


has the solutions z = 0 and z = ——=“— for the case at hand, a > L. Thus the 


point z = 0 happens also to be a critical point, and it is easy to check that the path 


of steepest descent of the real part from z = 0 proceeds along the positive real 
wW 


The path of steepest descent can be explicitly computed in our special case. 


axis, instead of the contour A, until it reaches the critical point at z = 


and then turns by 90° into the complex plane. 


The condition that the imaginary part be constant and equal to zero gives, with 
z=z%+1y, 


1 
ain { (e — 5) (a? — y? + Qiavy)? -w° (r? —y? + diay) =0. (4.29) 
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Path of Steepest Descent 


Figure 4.12. The integration contour along the path of steepest descent 


Thus 
(4 (a- 5) (2? —y?)— 20") xy =0 (4.30) 


>x=0, or y=0, or (2a—1)(2?-y")=u? (4.31) 


The first two solutions simply describe the x and y axes, the third solution 
corresponds to a hyperbola. Note that all of these curves intersect at 90° as 
we expect. The path of steepest descent, starting at the origin and going out 
to infinity at oo x e’4, corresponds to the curve A’, as depicted in Figure 4.12. 


Asymptotically the arcs of the hyperbola converge to the lines y = +z which is 


the original contour A. The turn by 90° occurs at the critical point at z = z = 
Ww 


4/(2a—1)` 


But now, where does the imaginary part to the integral come from? The 
integrand is always real, and the imaginary part of the function is always 
zero along the contour of steepest descent of the real part. It can only come 


from the integration measure dz when the contour follows the hyperbola in 
Ww 


(20-1) 


axis has no imaginary part, thus we are not interested in it. The integration 


the complex plane. The contribution from z=0 to z= along the real 


along the hyperbola we perform in the Gaussian approximation about the 


ps : zZ w = 2p tw — —_ydy 
critical point at z = Jan We have x = 4/y? + ant)? dx [Pray 
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so dz = dz + idy = | ——*—— + i | dy and the integral is 
y?’ tea 
oS if__w4 —2w2y2-+0ly4 
ti dy To. Px Gana y tol >), (4.32) 
0 


w2 
y ye (a—1) 


Therefore, the imaginary part comes only from the second term, and is given by 


4 

EVT ea, (4.33) 
where the factor of 1/2 in front comes because we are only integrating over half 
the Gaussian peak, while the full Gaussian integral gives Vanh Then for our 
original integral we get 


amt f ar |G a = ee x 2, (4.34) 
— 00 vV2rħ all 2 

where the factor of 2 arrives because we have the integral over the full contour 
of Figure 4.11, whereas the analysis above was only for half of the contour, the 
part in the first quadrant. We point out that the imaginary part of the integral 
simply corresponds to the formal expression of Equation (4.5) with A-1 > |A_1|. 


4.6 Analysis for the General Case 


Now, in the general case, we know what we must do. In order to do the 
path integral, we parametrize the space of all paths which satisfy the required 
boundary conditions for z(a, T = — 8/2) and z(a,t = 6/2) (6 can be effectively 
taken to be oo). We do this parametrization with one special, specific contour 
z(a,T) in the space of all paths, and augmented to this contour, we add the 
subspace of all paths orthogonal to this contour (which we will label as z, (7)). 
To be very clear, a contour is not a path, it is a curve, itself parametrized by a, 
in the space of paths, where each point along the contour corresponds to a path 
z(a,T). The specific contour will contain two critical points 


z(a=0,7T) = 2(T) =0, (4.35) 


which is the “instanton” corresponding to the particle just sitting on top of the 
unstable initial point in Figure 4.2 and never moving, and the point 


z(a =1,7) = z7? (7), (4.36) 


which corresponds to the instanton that we have called the “bounce”. This 
contour is represented pictorially in Figure 4.13 while the corresponding action 
is represented in Figure 4.14. We will see that the actual paths that the contour 
passes through are unimportant except for the two critical points. We also insist 


4.6 Analysis for the General Case 55 


Figure 4.13. The path in function space as a function of a and T 


Sglz(a,T)] 


Il F 
= 


a=0 (04 


Figure 4.14. The Euclidean action as a function of œ 


that the “tangent” to the contour at œa = 1 corresponds to the negative energy 
mode 


d 
Ta 20,7) |. 44 = 2-1 (7). (4.37) 


In this way, the orthogonal directions never contain a negative mode and the 
determinant (path integral over Dz.) can be done in principle. We then write 


the path integral as a nested product of two integrals 
da 
T 


N f Daek | E 


It is important to note that the path integral over the transverse directions is 


Dz (re ESEO], (4.38) 


a-dependent. However, we will find that, since we are actually only interested 
in finding the imaginary part of the full integral, we will need to evaluate this 
transverse integral only at a = 1. 
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Contour for @ 


æ -— plane 


a — complex 


æ -— real 


a=0 


Figure 4.15. The contour for a from the origin, along the real axis and then 
jutting out into the complex plane at 90° at a=1 


Now the integral over œ is, however, ill-defined due to the existence of the 
negative mode at a=1. As a=0 is a critical point which is a local minimum, the 
action increases as we pass from a =0 to a= 1 through real values of a. Hence the 
path in function space is defined as the path of steepest descent of —Sp[z(a,7], 
the exponent (up to the trivial factor of 1/A) in the integral Equation (4.38). But 
then we encounter the second critical point at œ = 1, which is a local maximum 
of the action, again for real a. The action behaves as depicted in Figure 4.14. 
Hence continuing the integral past a= 1 to a= œ, it fails to converge and give 
a sensible answer. However, we are actually only trying to find an imaginary 
component of the original expression. If in fact we could integrate from a = 1 
on to aœ =œ, the expression would remain completely real. Thus we can only be 
content that we must define the integral via analytic continuation, since that is 
the only possible way that the integral could obtain an imaginary component. 

This analytic continuation is expressed as a deformation of the contour of 
integration into the complex a-plane as we saw in the previous section. From 
a= 1 we must follow along the contour of steepest descent of —S'g[z(a,rT)]. The 
important point, as we have seen, is that for an ordinary critical point, which 
is generic and that we assume, this corresponds to a 90° turn into the complex 
plane, as depicted in Figure 4.15. We start at a=0 and go till a=1 on the real 
a line, then we continue out at 90° into the complex a-plane following the line 
of steepest descent of —Sz[z(a,7T)]. 
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As before, the imaginary part only comes from the integration measure; the 
imaginary part of —Sg|z(a,T)] on the path of steepest descent is constant and 
hence always zero. This gives for the imaginary part of the path integral for 
the fluctuations about one bounce (using the notation A.C. to mean “analytic 
continuation”), 
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(4.39) 


where i: Sr[z(a,T)||,—1 =A-1 is the negative eigenvalue and det’ now means 
the determinant is calculated excluding both the zero eigenvalue and the negative 
eigenvalue. In the last line of Equation (4.39), each factor separated by the x 
signs correspond, respectively, to: the factor of one-half since we are integrating 
over only half of the Gaussian peak, the exponential of minus the action of the 
bounce divided by A, the factor corresponding to the Jacobian of the change of 
variables and the factor of 8 when we integrate over the position of the bounce 
rather than its translational zero mode, the factor of one over the square root of 
the magnitude of the negative eigenvalue which is the upshot of our analysis of 
the analytic continuation, and finally the primed determinant over the orthogonal 
directions in the space of paths where the negative mode and the zero mode are 
removed. Taking into account the contribution from the multi-bounce sector, 
the one-bounce contribution, including its imaginary part, just exponentiates as 
before. 

Thus K, as defined in Equation (4.3), changes as K > Re{K}+iJm{K} and 
we find 


Jm{K} = 5 —— gall 2 e : (4.40) 
a |A-1 det £ +a] ' l 


where we now understand the factor of $ as coming from integrating, in the 
Gaussian approximation, over just half of the saddle point descending into the 
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complex a-plane and the primed determinant is now understood to exclude both 
the zero mode and the negative mode. Thus the original matrix element that we 
wish to calculate, Equation (4.1), is obtained from an analytic continuation 


A.C. {e= 7 RMP) |-=0) } =AC. [x 


Leg So \ 4 -$0 
(ra PAE 
= e~B(Eotil’/2)/h A C, { (Eo 0)|? Bier } (4.41) 
where 
es 
i 1 det’ (-4 +V" (ane) 2 
2 |A_1| det (-& +w?) 


K =Re(K)+i (4.42) 


This yields the imaginary part to the energy, il /2, with the width of the state 
nd 
2 


(4.43) 


T- ( So i eh det’ (-& +V" (greeny) 
|A_1| det (-& +w?) 


5 
Quantum Field Theory and the Path Integral 


5.1 Preliminaries 


We will consider the case of a classical scalar field theory and its quantization. 
Later in this book we will consider both vector and spinor fields. A classical 
scalar field $(a) is a real-valued function of the coordinates of space and time. 
The meaning that it is a scalar field is that the value that the function takes is 
invariant under Lorentz transformations. All inertial observers measure the same 
value for the field at a given spacetime point. 


p(x") = p (x) (5.1) 
where 
r” = AH” (5.2) 


with the standard notation z? =t and zt, i= 1,2,- -- d are the spatial coordinates. 
The transformation matrix A#, satisfies 


AEn N = (5.3) 


with diag [n7] = (1,—1,—1,---) the usual Minkowski space metric, which is the 
defining condition for a Lorentz transformation. In general, an equation of motion 
for a classical scalar field is a non-linear partial differential equation. We will 
restrict ourselves to the case of second-order equations, then Lorentz invariance 
dictates the form 


8,0" b(a") +V’ ($(2”))=0. (5.4) 


Written out, this equation is 


(Se - V7) ole") +" o) =o. (5.5) 
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Such an equation comes from the variation of an action, S [ġ(x#)], which is a 
functional of the field ¢(a"), i.e. 

S[o(a) + d6(x*)] — S[e(@*)| 


€ 


=0 
e=0 


V 66(x") = 0,0"9(a") +V’ (b(a")) =0. (5.6) 


55 [9(e")] = 


Then we find the action giving rise to equations of motion, Equation (5.4), is 
given by 


oat] = fate (Saoter)a"o(ar)—v (o(e"))) fee (6.7 


where £ is called the Lagrangian density. The kinetic energy is 


pa [dts (araoa) (5.8) 


while the potential energy is 


l= 5 
v= fats (FH o(ar) Polar) v oa) (5.9) 
which define the Lagrangian as L =T — V and the action is simply 


Solz") = f ar-v). (5.10) 


This defines a dynamical system which is an exact analogy to the particle 
mechanical systems we have been considering in the previous chapters. There 
are just a few simple conceptual changes. The dynamical variable is a function 
of space, which evolves through time. For a mechanical system the variables were 
the positions of particles in space and these positions were evolving through time. 
Now the spatial coordinates xê are not the positions of any particle. They are 
just parameters or labels, and they do not evolve in time. An important point 
to observe is that a dynamical variable which is a function of space, rather than 
a point in space, comprises an infinite number of degrees of freedom, in contra- 
distinction to the case of particle mechanics where we typically consider only a 
finite number of particles. This is easy to make explicit by expanding the scalar 
field in terms of a fixed orthonormal basis of functions on (x$), n =0,1,2,---, 


plat, t) =X en(t)dn(z"). (5.11) 


We can thus exchange the dynamical field ¢(x*,t) for an infinite number of 
dynamical variables {¢n(t)},=0,1,...,06 

This difference is the cause of almost all the problems that arise in the 
quantization of fields. We will proceed with the philosophy that these problems 
correspond to the extreme ultraviolet or infrared degrees of freedom, this 
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philosophy perhaps to be justified only a posteriori. We plead ignorance as 
to what dynamics actually exist at extremely high energies and simply reject 
theories where the answers to questions involving processes at only low energies 
depend on the dynamics at very high energies! Furthermore, we invoke the 
principles of locality and causality, which stated simply means that configurations 
at the other end of the universe cannot affect the local dynamics here. In this way 
we consider only theories which are unaffected by cutting off the infrared degrees 
of freedom. Thus, effectively, we are interested in theories with an enormous but 
actually finite number of degrees of freedom, since we can cut the theory off in 
both the infrared and the ultraviolet. However, this number of degrees of freedom 
is assumed to be so huge that it is well-approximated by ov, so long as that limit 
is sensible. 


5.2 Canonical Quantization 
5.2.1 Canonical Quantization of Particle Mechanics 


The canonical quantization of fields proceeds formally as for particle mechanics. 
First we briefly review how it works for particle mechanics. We find the classical 
canonical variables p; and q;, pj = aE and the Hamiltonian H = $; p,q; — L. The 
equations of motion are: 


di ={u%,h(qj,PK)} 


Di = (pi, hlaj, Pr) } (5.12) 
where {-,-} is the Poisson bracket, 
OAOB OAOB 
eer 3 ðqı Op; p; OG Pe 


Quantization proceeds with the replacement 
{A,B} > -4 [4.8] (5.14) 
yielding, for example, the canonical commutation relations: 
(ĝi, pj] = 1hd;,;. (5.15) 


All dynamical variables become operators, O > O, which act on vectors in a 
Hilbert space. 


5.2.2 Canonical Quantization of Fields 


Applying the above to the case of classical fields, we define the conjugate 
momenta in an analogous way, 


pae JE 
U(x',t) = een (5.16) 
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Then 
H= pare (Me! Ho(e',t) -£) 
Š ps (Fres + sVola't) Vol(ai,t) + V(o(e'.t)) _ (5.17) 


The Poisson bracket is now given by (for local functions of ¢(a*),(a*), we can 
dispense with the functional derivatives and just write partial derivatives, as 
they give the same answer) 


OA OB OA OB 


= d—1 
[ana / FGA) Olas) Olat) Ode on 
which includes the fundamental Poisson brackets 
{o(a*,t), I (a?,t)} = 64" (at - xt). (5.19) 


We impose the same quantization prescription as in the particle mechanics 
case, given by Equation (5.14). This yields the celebrated equal time canonical 
commutation relations 


[bla t) i, t)| = inata —y'), (5.20) 
The (Heisenberg) equations of motion follow from the commutators: 
EE [o(a' t) a (5.21) 
dt 9 ? 9 
a OR a Ap hee 
ihia, t) = [Fe 1), (5.22) 


There is a lot of mathematical subtlety in the definition of the product of 
the quantum field operators of a one-spacetime point which is required in 
the definition of the Lagrangian and Hamiltonian. Indeed, the quantum field 
operators that satisfy Equation (5.20) cannot be simple operators but in fact 
are operator-valued distributions. The operator products required to define 
the Lagrangian and the Hamiltonian are not straightforwardly well-defined. 
Canonical quantization can be made to work reasonably well for the case of 
linear field theories, for example see [107]. 

So far we have been considering the quantization in the Heisenberg picture. 
The variables are dynamical while the states are constant. We can equally well 
consider the quantization in the Schrédinger picture, with the transformation 


Îl, t) > ÎS (xt) = U (HÊ (at, t)Ut (t). (5.23) 


Then we find, 


= =0, (5.24) 
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i.e. the fundamental quantum fields in the Schrödinger picture are time- 
independent, if U(t) satisfies 


aa ‘ 
ih U(t) = HU(t). (5.25) 


The formal solution of this differential equation is U(t) = eitH/h Evidently 
H commutes with U(t). The corresponding transformation of the Hamiltonian 
yields 

AH > AS =U(t)HU' (t) = Ê. (5.26) 


This states that the Hamiltonian for time-independent problems does not depend 
on the representation. If we have an eigenstate of H, 


A |Y) =E|¥) (5.27) 

then 
ASU(t) |v) = UHU (t)U(t) |v) = EU (t) |V). (5.28) 

Thus 
ng (U(t)|V)) =U AY) = AS (U(t) 18) ) =E (U(t) |8) ) (5.29) 


which is just the Schrödinger equation. 


5.3 Quantization via the Path Integral 


Now the path integral for a quantum particle mechanics amplitude in Minkowski 
time, as given by Equation (2.42), yields 


iTh(X.P) S[z(t)] 


(yje |x) =N f page. (5.30) 


This formula was proven assuming nothing of the nature of the space in which x 
and y took their values. Typically they were coordinates in R”, but they could 
have been in any configuration space of unconstrained variables (with constraints 
additional terms can appear [76]). Actually we have 
_iTH (4B) af ; Sla] 
(aple rn Ja) =N Dq(tye h, (5.31) 


di 


where q(t) could be any generalized coordinate, for example, an angular variable 
of a rotator or the radius of a bubble which changes its size. 

Then, for quantum field theory, we simply let q take values in the space of 
configurations of a classical field. This gives 


iri($ i) 


of 
(ipl e) =N f Dalah) 


. S[o(x")] 
areh ni (5.32) 
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The states |¢;) and |f) correspond to a quantum field localized on the 
configurations ;(x") and ¢,(#"), respectively. The states |¢) are directly 
analogous to the states |Z) that we considered earlier in particle quantum 


mechanics. These were eigenstates of the (Schrédinger) position operator X 
XÐ =2|z). (5.33) 
In that respect, the states |¢) are taken to be eigenstates of the field operator 


$S (ai) lp) = 9(2") lo) . (5.34) 


The states |¢) are also improper vectors, as the states |z) were, and true states 
are obtained by smearing with some profile function 


IF) = i DéF(¢)|¢) (5.35) 


where F'(¢) is a functionally square integrable functional. The inner product is 
defined by 


(F|G) = / D6F*(4)G(¢). (5.36) 


We call the Feynman path integral in this case the functional integral. It is a 
rather formal object in Minkowski space, but it can be used to generate the 
usual perturbative expansion of matrix elements, in a rather efficient manner. 
(Its analogue in Euclidean space, which we will use, can be rigorously defined in 


some cases.) 


5.3.1 The Gaussian Functional Integral 


We can essentially perform only one functional integral and that, too, not 
necessarily in closed form. This is the Gaussian functional integral. However, 
if we can do the Gaussian functional integral it is sufficient to generate the 
perturbative expansion. Consider the functional W [J] of some external source 
field J(x") defined by 


W [J] N DT A E S eI) 
oe d d N 
=> aut | aroo dend (a1) Tan )G™ (21, EN), (5.37) 
N=0 
where the integrations are done over all of spacetime and we impose the boundary 


conditions on the field the ¢(a") > 0 as |a"| — oo. Then the so-called N point 
Green functions of the theory are obtained via functional differentiation 


Geman) = (F) (Srey area) Y 


(5.38) 


J=0 
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Correspondingly, W[J] is called the generating functional since it can be used 
to generate all the Green functions of the theory. We will show that the 
GN (x1, ,£&y) corresponds in principle to the matrix elements 


(OJT ($ (a1) ---8¥ (en) 10) (5.39) 


where the state |0) is the eigenstate of the Schrödinger field operator with 
eigenvalue $(x*) =0, i.e. 5(x*)|0) =0. 

For a Hamiltonian that depends on time, HS (t), which is the case here with 
an arbitrary external source J(x”), 


AS (t) = A? + Att (5.40) 
77 =| ds (Site ite" + ISe’) 945 (at) +V (4*¢e4)) (5.41) 


A(t) = I dtg (J (z')) (5.42) 


one can easily prove that the path integral gives rise to 
i _i pT/2 7S 
N f Deeks = jim (y| T (« tS ryz 4H °) |x) (5.43) 


where T(A(t1)B(t2)) = 0(tı — te) A(ti) B(t2) + 0(t2 — t1) B(t2)A(t1), the usual 
time-ordered product. The time-ordered product here yields the limiting value 
of the (infinite) ordered product of infinitesimal unitary time translations over 
each of N infinitesimal time elements, € = T/N between —T/2 and T/2, ordered 
so that the latest time occurs to the left 


. T A 
-i f Zp dtS (t) 
Tle 72 


= lim eiA De iA 9.. eKA OCT e REAP Ft) (5.44) 
N-0o 

The Hamiltonian being time-dependent because of the, in principle, time- 

dependent external source J(a’,t). The derivation of the path integral goes 

through as before by inserting a complete set of states between the infinitesimal 

unitary transformations. (There is a completely analogous expression for the case 

of the Euclidean path integral, where the time-ordering is replaced by Euclidean 


time-ordering, which is sometimes called path-ordering.) Thus we find with 


i ¢T/2 As 
W(J]= Jim wir (e fS rya m) 10) (5.45) 
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then 
—ihd —thd 
ae WIJ = 
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= SENN T RAJt 
ay es Te 


xT (e i Syd ates p E CD 10)| 
for ti >t >- >tyN 

= ot (eiar ansa $3 (i )T (i ASO) 59 8). P(e) 

) io) 

J=0 
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—i S ~ ~ 
= (O/T ($A? O) SH (iT (ah). dH (ah) I0) | 
> (B=0|T ($7 (HF (e) p (eh) 1B =0)| (5.46) 


J=0 


ae g 
x Teck tN tS (t 


J=0 


where we have explicitly written the Heisenberg fields as oH (at) = 
T (et Ji AEE $9 (x*)T (e ESE WATE) while the Schrödinger operators 
are defined with respect to t =—oo. Here |0) still corresponds to the state with 
(x) =0 while the state |E =0) corresponds to the true zero-energy vacuum 
state. However, the last identification in Equation (5.46) requires explanation 
as it is not exactly the same as Equation (5.39). As we will see, once we define 
the functional integral more carefully, instead of computing the matrix element 
in Equation (5.39), the functional integral projects uniformly onto that which 
corresponds to the matrix element in the state of zero energy, the vacuum state. 
At the present juncture the definition of the functional integration is extremely 
formal, and neither the operator-valued matrix element in Equation (5.39) nor 
its functional integral representation exist. 
If we nevertheless continue formally, we find 


=N f DeF t ihs) x 


xeta x(2 Oynd(at tor (zt jt)- im 267 (a? t+ J (a? t)o(a? t) 


—i d ; ô 
erla zv ( ares) WJ), (5.47) 


W?[J] is a Gaussian functional integral, which we can explicitly perform. We use 
the formula, which as written is only formal but becomes valid if defined via an 
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appropriate analytic continuation 


im dx ae dx itl(e-t)? ,-idb(2)6 


=o V2T oo V2 
Sees (5.48) 
—ia 
which generalizes to 
/ d a cig (@A-4) 4206.2) = (det(—iA))~2 eii (A715) (5.49) 
(27)? 


for finite dimensional matrices. Boldly generalizing to the infinite dimensional 
case, for W°[J] we find, with A — — (0,0" +m?) and b— J (and absorbing an 
infinite product of 7’s into the normalization constant), 


N -4 fd%ed4y(J(e) (2| 4 |) J(y)) 
WJ] = e? ~(Ən ðH +m?) (5.50) 
/ det (0,,0# + m?) 
5.38.2 The Propagator 
It only remains to calculate 
(2 ee ) = | Seem (5.51) 
—(0,0" + m2)" ~ J (any kukt — m?’ i 


We seem to be on the right path to defining the functional integral; however, 
we come up against another problem: this Green function is ambiguous. This 
problem is only solved via analytic continuation. In the Fourier representation, 
for example, there are poles in the kg integration at ko = +\/ |k|2 + m2. We 
cannot integrate through the poles, we must provide a prescription for integrating 


around them. Such a prescription translates directly into fixing the asymptotic 
boundary condition on the solutions of the problem, for ¢ 


(0,0% +m?) d= J. (5.52) 


Clearly any solution for ¢ is ambiguous up to a solution of the homogeneous 
equation 


(0,,0" +m”) do = 0. (5.53) 


Correspondingly, the Green function to Equation (5.52) is also ambiguous by the 
addition of an arbitrary solution of the homogeneous equation. The asymptotic 
boundary conditions on ¢ fix the Green function. These boundary conditions are 
equivalent to giving the pole prescription. 
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5.3.3 Analytic Continuation to Euclidean Time 


The existence of homogeneous solutions corresponds to zero modes in the 
operator A = — (0,,0" +m’); hence, the original integral was ill-defined. The 
problem can be traced back to the matrix element 


W[J] = lim (0|T G ASTE eA >) |0) . (5.54) 

The operator in the matrix element can be written, for an arbitrary future time t, 
T (eka) _ eT Sirt B. of Sira (e-t eee) 

=e hat yy 7/2), (5.55) 


Then U(t,—T/2) satisfies the differential equation 


POUCES _ 4 


= Erpat A? print. (poh raw A U 7/2) 


= A" (HU (t,—-T/2) (5.56) 
where 


A z i rt iOa £ i rt 1 70 
Haya fat tesal je tanh $3 (zije J-r/24 H 


= / d*-1yJ(x',t)b! (2*,t) (5.57) 


defines the interaction representation Hamiltonian and the interaction represen- 
tation field ¢! (x,t). The solution of the differential Equation (5.56) is unique 
with boundary condition U(—T/2,-T/2) = 1 and given by 


U(t,-T/2)=T (t Sirit A ba (5.58) 
Thus 
_i pT/2 ard I 
W{J]= lim (Ole fS ryz% H r(e -ASIF ATE ’) jo. (5.59) 
00 


The state |0) corresponds to an eigenstate of the Schrédinger field operator with 
the eigenvalue zero, and is not an energy eigenstate of the Hamiltonian, hence 


= 5° Cz\E) (5.60) 
E 
where 
H°|E) =E|E). (5.61) 
Then the matrix element in Equation (5.59) is given by 
4 $ T( 
= lim X et ETE CCEE (e Eef sae i |E) (5.62) 


T =œ 
E,E' 
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This expression is generally not well-defined. The infinite phases give an ever- 
oscillatory contribution which does not exist in the limit T — oo. We are in fact 
interested in the matrix element and its various moments which give rise to the 
Green functions, as J — 0. Even in this limit, we get that W[J = 0] is ill-defined; 
if any of the Cp £0 for any E £0, then 


WIJ =0] > Se AOE Cpl? (5.63) 
E 
Thus, somehow we must project onto the ground state, defined to have E = 0. 


This would happen if we can add a negative imaginary part to E. Equivalently, 
if we rotate 


tor=-it d’r—> -idr (5.64) 
the action goes to 
1 
Sis? = if ate G (Ə pð p +m?) +V ($) — Jo) ; (5.65) 
and the matrix element is 
(OT (er Ms ae) 0) ~ (E =0|T (Cae ey) IE=0). (5.66) 


|E =0) is the zero-energy vacuum state of the theory with J = 0. Then the 
functional integral gives 


N' f Doe = (E=0|T (e? SA |E =0) (5.67) 


and the Minkowski space functional integral is defined by the analytic 
continuation of this object to real times. 
The rotation t > —77 yields the Euclidean operator (—O, 9, + m?) ġ which has 


no zero modes, 


(—0,,0, +m’) d6=0=> ¢=0. (5.68) 
Thus 
-sE 
N' | Doei 
N’ _ fat v(a 5 ) - patty (ste (lnl) i) 
gn O y x SI (ey e (-3p3u +m ) 
det(—ôp 3u +m?) 
(5.69) 
where 
1 d'k ik, (2- 1 
iku (æ—y)y 5.70 
(z| (ð, ð, +m?) g ic a© (kuk, +m?) (3:70) 


which is now well-defined. 
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The analytic continuation back to Minkowski space (xo — yo) — i(£o — Yo) gives 
the Minkowski Green function with the “correct” Feynman prescription at the 
poles 


leah) = f Sete ay CD 
(00r Fm l T) na (kukt — m2 Fie) l 


Thus once the Minkowski space functional integral is defined via the analytic 
continuation back from Euclidean space, it clearly gives the vacuum expectation 


value 


WIJ] = (E =0]T (e4 ao |E =0) 


4 atk Jk F(—k)_ 
2 (kukt —m?+ic) ‘ (5.72) 


E = fatev( ths pay ) f 


For example, the Feynman propagator is obtained from 
dik e`iku(s—y)" 
(27)? kykt — m? + ic 


Ar (21,22) = (E =0|T (¢(#1)6(x2)) |E = 0) = (5.73) 
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Decay of the False Vacuum 


In this chapter we give the first example of an application of the methods we 
have learned so far. We will apply the methods of instantons to the problem of 
vacuum instability in quantum field theory. We consider a scalar field governed 
by a Lagrangian of the form 


L= | dx 30,0(0)0" (2) Vola). (6.1) 


The potential V (¢(x)) for ¢(a) = ¢, a constant independent of the spacetime 
coordinates, has the form represented by the graph in Figure 6.1. There are two 
minima, a global minimum at ¢_ and a local minimum at ¢ . Classically the 
configurations ¢(x) = ¢4 are stable. The energy is given by the functional 


E= J de 54(2) +194): Vola) + Voa). (6.2) 


When ¢(x) is a constant the first two terms, which are positive semi-definite, 
give zero contribution, thus the energy comes solely from the potential term. The 
potential is minimized and normally adjusted by adding a constant to make it 
vanish at the global minimum ¢() = ¢_, so normally the energy of this classical 
configuration is zero. At ¢(x) = 6, the potential is in a local minimum, however, 
and then the value of the potential is finite and the total energy is divergent. 
The divergence is proportional to the volume. However, the physically important 
quantity is not the total energy but the energy density, which is given directly 
by the potential. Then the energy density difference between the two classical 
ground states is finite. 6, is the false vacuum while ¢_ is the true vacuum. The 
false vacuum is unstable while the true vacuum is stable. 

We will, however, adjust the zero of the potential not in the normal way but 
as depicted in Figure 6.1, by adding a constant, so that the energy density of 
the false vacuum state is zero. Such a redefinition cannot affect the local physics. 
Then we will calculate the decay of the false vacuum to the true vacuum per 
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Figure 6.1. The potential giving rise to a false vacuum 


unit time and per unit volume, L. We will find an expression of the form 

T B 

y = Ae~* (1+ 0(A)) (6.3) 
in the semi-classical limit. This form is exactly that which we have seen for 
decays via tunnelling. B will correspond to the classical action for a critical 
configuration while A will come from the quantum considerations. We proceed 
in an analogous fashion to the problem we considered in quantum mechanics. 
We wish to define the analytic continuation of the matrix element 


A.C{(b4 |e |b4)} (6.4) 


from a potential for which the vacuum constructed at + is stable to the potential 
we are considering. As we have seen, the analytic continuation instructs us on how 
to deal with Gaussian integrals over fluctuations about a critical configuration 
which correspond to negative frequencies. 


6.1 The Bounce Instanton Solution 


Otherwise we proceed in the usual way with the semi-classical analysis of the 
Euclidean functional integral. We look at 


N J Doe- Ee (6.5) 
with the boundary conditions ¢ (7 = +4) = $4. Here 
Selo) = f dda (53,0(0)9,0(0) +V(6(e)) (6.6) 
with the equation of motion corresponding to 
SSe _ _ 9,5, 6(0) +V Eeo: (6.7) 


õp 
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Here we use the Euclidean metric. This equation is exactly the equation of motion 
for the scalar field in minus the potential. We take the boundary conditions for 
the case 8 = 00 


and we add the condition 
0,6(7,T =0) =0, (6.9) 


which determines the Euclidean time at the classical turning point. This time of 
the classical turning point is completely at our disposal for the case 6 = oo. The 
condition that the classical action should be finite gives 


lim $(#@,7) = $4. (6.10) 


|Z] 00 


We assume a form that is O(4)-invariant 


1 
o(@,7) = 4 (02? +7°)2). (6.11) 
The equation of motion becomes, with p = (|Z|? + 72)3 
d? 3d 
—56+-—$-V"(¢) =0. (6.12) 
dp pdp 


The action is ? 
ec 1/d 
Spld]=2n° f ap (; ($) ivo) (6.13) 


with the boundary conditions lo =0 and lim,.. (p) = +. The first 


condition avoids a singularity at p = 0 while the second comprises all of the 
asymptotic boundary conditions. 

A rigorous proof of the existence of a solution and that it is the minimum action 
solution is given by Coleman, Glaser and Martin [34], but we shall be content 
with the following argument due to Coleman [31]. The equation of motion (6.12) 
can be interpreted as that for a particle with “position” ọ moving in “time” p. 
The particle is subject to a force, —V’(¢), and a frictional force with a “time’- 
dependent Stokes coefficient of friction z: The equation of motion for a particle 
in a potential with Stokes coefficient of friction u is 

d? d ; 
apr) Ha, ole) + V'(e(p)) = 0. (6.14) 


The solution in the absence of a potential, V’(¢(p)) =0, is simply ¢(p) =a — be™”’ 
for arbitrary constants a,b, with a related to the initial position and b related 
to the initial velocity. This solution confirms that motion with friction without 
external forces will come to rest exponentially fast. In the present case depends 
on p. 
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Figure 6.2. The reversed potential and effective dynamical problem 


We can prove the existence of a solution satisfying our boundary conditions by 
the following continuity argument. We must show that there exists an initial point 
Qo from which the particle can start at p = 0 and achieve = $4 at p =œ. The 
potential is reversed to give —V(@) as depicted in Figure 6.2, and ¢ is defined 
as the point at which the potential crosses zero. If ġo > ¢1 , (p) will never 
reach ¢4 even as p— oo starting with zero velocity. If, however, ø- < ġo < 41, 
and po is sufficiently close to ¢_, (p) will surpass p+} at some finite time. We 
can understand this intuitively; if ¢9 is arbitrarily close to ¢_, the particle will 
roll off this potential hill arbitrarily slowly. We can make this time so long that 
the coefficient of friction, A becomes negligibly small. Then the particle will roll 
off and eventually climb the hill at ¢; and even surpass $+ since it is now a 
conservative system. Indeed, for ¢(p) close to ¢_ we can linearize the equation 
of motion, 


d2 3d 2 — 
(F042 Fou ) (0) -9-)=0. (6.15) 


which has the solution 


(6.16) 


where w? is V”(¢_), and (wp) is the modified Bessel function of the first 
kind. This implies that for (¢(0) — ¢_) sufficiently small, (@(~0) — @_) can be 
kept arbitrarily small such that (po) < ¢1 so that the potential energy remains 
positive, where po is determined by the condition that the subsequent energy 
lost to the friction term is negligible. Once the friction term becomes negligible, 
the system is conservative and, since at po the potential energy is positive, the 
particle will clearly surpass ¢, at finite pf. A measure of the energy lost in the 
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friction is obtained by the integral 
f 


” 3d 3 [f d 3 a 3 
[wi ce< =f aġoxž | d=Ž-6). (617 
Po pap po Po P Po bd po 


Thus we choose po large enough so that this energy is negligible in comparison 
to the energy scales that drive the dynamics, say V (0): 


3 
~ ($+ -¢-) <V(0). (6.18) 
PO 

Then finally we conclude that there must exist some intermediate ġo from which 
0(p) will attain d, exactly as p— oo. This implies the existence of a solution of 
the form we desire. 


6.2 The Thin-Wall Approximation 


We can go much further with the assumption that the energy density difference 
between the two vacua is small. 


5 ($-a) (6.19) 


with U(¢) = U(—¢), U' (+a) = 0, U” (+a) = w? and € is arbitrarily small, as 
depicted in Figure 6.3. We can calculate the action for the bounce to first order 


in e. The reversed potential is given in Figure 6.4. At p= 0 the field is very close 
to —a, it stays there for a very long “time”, and then it rolls relatively quickly 
through the minimum of the reversed potential, up to the hill at ¢ = +a since 
now the friction is negligible. It achieves 6 = +a only as p > oo. The bounce is 
like a large four-ball of radius R, in Euclidean space, of true vacuum, separated 
by a thin wall, from the false vacuum without. 
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Figure 6.3. The symmetric potential with a small asymmetry 
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Figure 6.4. The reversed symmetric potential with a small asymmetry 


For p near R, if we drop the friction term we obtain the equation of motion 
(to zero order in €) 


d2 
gp? V=. (6.20) 


This is exactly the same equation that we have studied in the double-well problem 
of Chapter (3). The instanton solution interpolates from one well to the other as 
in Figure 3.3. It is given in this region, which is near the wall, approximately by 
the equation 


(p) do j 
p-R= c. 6.21 
o  yv2U($(p)) 
For large |p — R], the solution is given by 
bp) =+ (a = gg viel . (6.22) 
For example, for the choice of the potential 
À 2 2\2 
U(~)=F (¢° —a?) (6.23) 
the solution is 
$(p) = atanh (w (p— R)) (6.24) 
with a = 2a and w? = 2)a?. 
Thus our bounce is given by 
—a O0<p<R 
Pbounce(P) = olp) p% R : (6.25) 


a p>R 
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To find R we do a variational calculation in R. 


R-A R+A 
1 
Sp [Pbounce] = r? dpp*(—e) + an? f dpp? z 
0 


~ -3n Rie + 20 R? S1, for R>A, (6.26) 


where Sı is the action for the one-dimensional instanton ¢(p) calculated in 
Equation (3.27) which is independent of R (we call it Sı here to emphasize 
that it is the one-dimensional instanton action), 


Cae T ae (iio) +U (4) ) 


—oCo 


z / deU. (6.27) 
Sg(R) should be stationary under variations of R, 
aen i —2r° R’e+ 67r? R° Sı =0 (6.28) 
dR 
hence 39 
R=, (6.29) 


€ 
This confirms our expectation that R —> œ as e > oo. Finally, the Euclidean 
action for the bounce is 


1 4 3 2 204 
gouies = ae (=) e+2n? (=) S= TT Si (1 + o(€)3) ; (6.30) 
€ € 


2€ 


6.3 The Fluctuation Determinant 


The calculation of the coefficient A of Equation (6.3) is not so straightforward, 
even approximately. It is given by the determinant of the operator governing 
small fluctuations about the bounce. 


bounce 


, 2 
TA l4) =e7 ET N det”? (=E -VPV (Ooonee)) . (6.31) 
T 


When we attempt to evaluate the determinant we encounter the same 
problems that we have already seen in particle quantum mechanics: non-positive 
frequencies for the spectrum of Gaussian fluctuations. 

Zero modes come from invariance of the action under translations. We can 
translate in space and Euclidean time which gives us four independent zero 
modes (we write @pounce aS by for the sake of brevity) 


o 
PulT,T) =N>— 


san (2,7). (6.32) 
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Zero modes correspond to continuous degeneracies of the critical point of the 
Euclidean action. Here they correspond to the arbitrariness of the location of 
the centre of the bounce in Euclidean R^ which is actually R. We cannot 
integrate over these directions in the integrations over fluctuations about the 
bounce; however, we can equivalently integrate over the position of the bounce 
in Rf which is actually R. This gives a (divergent) factor of BV and a Jacobian 
corresponding to the change of integration variable from the fluctuation degree 
of freedom to the coordinate giving the position of the bounce. The Jacobian 
factor is of the same type as before, indeed, 


ie N da grle- #0)" )de" (6.33) 


for an infinitesimal change dc, of o coefficient of the Gaussian fluctuation along 
the normalized zero-mode direction, + Bae? ((a—20)”), while 


o v 
õp = gar ((a—2x0)”) dz}. (6.34) 
Equating the variation in Equations (6.33) and (6.34) gives 
dc” N 
= d 6.35 
V 20h 27h a ( ) 
Now 
1 ə vw O Op 
[tesa gape C-e) gapou (a —20)") = Fits f dellade). 


(6.36) 
We can evaluate this integral by using the fact that the action Sz is stationary 
at the bounce. 


d 
0= = Se (Ar) 


“al (5 (Ou.bo(Ax) 2,dv(e)) +V (one) ) 


A=1 A=1 
= fate (525 Oroa D) 
-f ite (- 24 (aupa) Bole) -4V (6nle))) 
EA E J dx (Onbv(0)On60(x)). (6.37) 
Hence 
J tx (Q.0(0)2,6s(0)) = 48 (dole) (6.38) 
and finally 
Se [do(x)| (6.39) 


exactly as in the one-dimensional case. The Jacobian factor becomes 


4 
( ofp ) , giving the integration over the position of the bounce 


(Sz [es()))” 


ee (6.40) 
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We do the same analysis for N well-separated bounces, which are approximate 
critical points, which gives us 


(Se lN (BV) 
CO 


where the N! simply indicates that the permutations of the positions of the 
bounces do not give new configurations. This gives 


BH 1 -(6 (TER) elec? e) 
lole T |b4) =Ndet 2 (—0,0, +V” (b4))e an?h l 
(6.42) 

where K is now the ratio 


a (3p ðu +V” (Ø) ) -3 
det (—3 3u + V” (b+) 


and the prime indicates that the zero modes are removed. The normalization 


(6.43) 


constant M is defined to exactly cancel the free determinant that appears 
Ndet™? (-0,8, +V" (¢4)) =1 (6.44) 
This is, not the whole story, because the operator 
— 8,9. +V” (bs) (6.45) 


has a negative mode. Again our analysis of meta-stable states in quantum 
mechanics applies directly. Taking into account the factor of 4 which comes 
from the analytic continuation and deformation of the contour, we find 


Eoi (ssi) awa 
det (—3 ð, +V"($4)) l 


(6.46) 


V 4RR 
The prime still indicates that only the zero modes are removed, the square root 
of the negative eigenvalue reproduces the imaginary nature and the factor of i 
is taken into account because the lifetime is i of the imaginary part. Analysis of 
the negative modes is left for Section 6.5. 


6.4 The Fate of the False Vacuum Continued 


We continue our analysis of the decay of the false vacuum by considering 
the evolution of the field after the tunnelling event. We can obtain some 
intuition from the WKB analysis of tunnelling in particle quantum mechanics. 
Consider the decay of a nucleus by a-particle emission. A reasonably successful 
phenomenological potential has the form of a square well of depth extending to 
less than zero attached to a short-range drop off potential from the top reaching 
to zero, as depicted in Figure 6.5. The negative energy levels in the well are stable, 
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Figure 6.5. A nuclear tunnelling potential 


but the positive energy levels are meta-stable and decay by tunnelling. The semi- 
classical description of the decay process proceeds as follows. The particle stays 
in the well up to a time, the “transition time”, which is a random variable, when 
it makes a quantum jump to the other side of the barrier. It appears suddenly 
at the other side at a point, which we call the “tunnelling out point”, with the 
same energy as the meta-stable state within. Subsequently, it continues like a 
free classical particle until it eventually moves off to infinity. 

Quantum mechanics only enters in the calculation of the process of barrier 
penetration. It allows us to calculate the mean value of the “transition time”. 
In the WKB analysis, the tunnelling out point is the point on the other side 
of the barrier with equal energy to the energy of the meta-stable state inside, 
from which, if the particle were released, it would move off to infinity under 
the classical dynamics. This is the turning point in the usual WKB analysis. 
We identify this point as the point where all velocities are zero in the bounce 
solution. We choose this point by the condition 


8, (B,7)|pao =0- (6.47) 


This is satisfied by the O(4) symmetric ansatz that we have taken, 


=0. (6.48) 
T=0 


3- (P)l-=0 = 3p% (P) (z) 


The field appears at 7 = 0 in the state described by ¢,(#,7 = 0) and then evolves 
classically. The WKB analysis should not be taken too literally. It will not be 
accurate for observations made just after the tunnelling event occurs. It is more 
correctly an asymptotic description for what happens long after and far away 
from the tunnelling event. 


6.4.1 Minkowski Evolution After the Tunnelling 


We continue nevertheless with the initial condition for after the tunnelling event 


é(#,t=0)=¢,(#,7=0), 076(2,T)|,p =0 (6.49) 
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and then the field evolves according to the classical, now Minkowskian, equation 
of motion, 
d? 

(v Jeev (ġ(z,t))=0. (6.50) 
At t=0, 6(Z,t = 0) = ø (7,T =0) is exactly a bubble of radius R of true vacuum, 
separated by a thin wall from the false vacuum without. This is because ¢)(Z,7) = 
do(/|€? +77) > (r) for t =0 with r = |z|. We can immediately write down 
the solution to the classical Minkowskian equation of motion for the subsequent 
evolution of the bubble. Simply 


6(8,t) = (VP-P). (6.51) 


In detail for the Minkowskian signature, with p= /|Z? —? = y-t", 
BIAB) = ( Aaa) 
= gpa i 4 papas. (6.52) 
Using 0,p = —+, 0,p0"p = —1 and hence 0,0") = E we get 


2 


3,3" ol) = — (= 


3 5 
Fi =) 6(A). (6.53) 


The Euclidean equation satisfied by p ę /\x|? +7?) is 


(gtv) on (VERSA) -v (on (VEPFA))=0. 659 


Since 
2,2,0(0)= (Fat =) on) (6.55) 
gives He 6 
(a+ 5) e-v e=. (6.56) 
P p 
Thus 


(D-7 ) a (VEP =P) +V' (o,(Vi@P-#))=0 (657) 


and it should be noted that this solution is only valid for |Z|? > ¢?, i.e. for the 
exterior of the bubble. 

Then the O(4) invariance of the Euclidean solution is replaced by the O(3,1) 
invariance of the Minkowskian regime. This implies that the evolution of the 
bubble appears the same to all Lorentz observers. When the bubble is nucleated, 
the wall of the bubble is at r ~ R, and then it follows the hyperbola, 6? = 
r? —t? = R?. This is because the functional form of 4(() describes the wall for 


all p= R?. This means that the bubble grows with a speed which approaches 
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Figure 6.7. Collision warning time with the growth of the bubble wall 


the speed of light asymptotically, as depicted in Figure 6.6. How quickly the 
growth approaches c depends on R. If R is a microscopic number, like 10719 > 
1078? as we would expect, the bubble grows with the speed of light almost 
instantaneously. If a bubble is coming towards us, the warning time we have is 
given by the projection of the forward light cone from the creation point to our 
world line (vertical), as depicted in Figure 6.7. The time this gives us in warning, 
T, is essentially the time it takes light to travel the distance R, as long as the 
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observer is far from the creation point relative to R. For R micro-physical, T 
is also microphysical. After the bubble hits us, quoting directly from Coleman 
[31]: “We are dead. All constants of nature inside the bubble are different. We 
cannot function biologically or even chemically”. But, paraphrasing, as further 
pointed out by Coleman, this is no cause for concern, since for R~ 10715 metres, 
T ~3x 1078 seconds, this is much less time than the time it takes for a single 
neuron to fire. If such a bubble is coming towards us, we won’t know what hit us. 


6.4.2 Energetics 


The energy carried by the wall of the bubble is exactly all the energy gained by 
converting a sphere of radius R of false vacuum into true vacuum. The energy 
in the wall per unit area is 


e= ae | Eeli) +v) 
= f. ie G (So) +V(6s)) 
z F dr G (To) +v) =i; (6.58) 


—0O 


Now, in time, the wall follows the hyperbola r? — t? = R?, hence the energy in 
the wall always stays in the wall. After some time, the element of area will have 
a velocity v. Energy per unit area just transforms as the zero component of a 
Lorentz vector, 


Sı 
Sı > ——.. 6.59 
t y1 mee 
So at such a time the energy in the wall is 
S 
E =4nr? — (6.60) 


Vv1l—v? 


with 


_ Ene (3) = ae (6.62) 


(In the thin-wall approximation, we have R = 351 ) This is exactly the energy 
obtained from the conversion of a ball of radius r of false vacuum into true 
vacuum. Hence all the energy goes into the wall. Inside the bubble is just the 
tranquil, true vacuum. There is no boiling, roiling, hot plasma of excitations. 
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6.5 Technical Details 


We complete this chapter with some technical points which we have left 
unaddressed. 


6.5.1 Exactly One Negative Mode 


We have assumed that there was exactly one negative energy mode to the 
operator governing small fluctuations 


We can prove this in the thin-wall approximation. O(4) invariance means that 
we can expand in the scalar spherical harmonics in four dimensions 


1 
on,j (p,Q) as a Xn ij (p) Yj mm! (Q), (6.64) 
p2 
where Y; m,m (Q) transforms according to the representation D2! of SO(4) = 
SO(3) x SO(3), with m and m’ independently going from —j to j. These are 
the eigenfunctions of the transverse Laplacian in four dimensions. Then to zero 
order in e€, 


(a EE Luna) xn O =Angrns(P) (668) 


for the resulting radial equation. This is analogous to the Schrödinger equation 


for a particle in a radial potential in three dimensions. 


The zero modes 1 


J—— 9 hp) 
Spouse 


transform according to the j = i representation. (4 + 4 =1+0 for the three- 


(6.66) 


dimensional rotation subgroup.) Since ¢»(p) is an increasing function, it starts 
at ġ— and increases to ¢4 at p =œ, the zero modes have no nodes. Hence they 
are the modes of lowest “energy” for j = 5. For j > i the Hamiltonian is simply 
greater than for j = $, hence all modes have energy greater than zero. Thus 
the negative modes can only arise in the sector with j = 0. There must be at 
least one negative mode since the Hamiltonian is simply smaller for j = 0. In the 
thin-wall limit, U” (¢y(p)) has the form given in Figure 6.8 where w? = U” (4). 
This is because ¢,(p) starts at d_ at p =0 and stays so until about p = R where 


it interpolates relatively quickly to ¢1, and then stays essentially constant until 
p=oo. The zero modes, corresponding to derivatives of œ (p), hence have support 
localized at the wall. The negative energy modes must also be localized there. 
Thus we approximate the equation near p + R by replacing in the centrifugal 
term p —> R. This yields the equation 


( oe t n ay! (on(o))) Xn,j (P) = Àn,jXn,j5 (P). (6.67) 
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U"(Oy(P)) 


Figure 6.8. The potential for the small fluctuations about a thin-wall bubble 


Clearly 


8il +1) +3 
Anj = An + — ZR 


with An, ordered to be increasing with n, evidently independent of 7. For R — ov, 


(6.68) 


An are the eigenvalues of the one-dimensional operator 
d? 

(Etro), (6.69) 
where f(x) = (x) with x € [—-00,00], i.e. we can neglect the effect of the 
boundary at p = 0. We already know that for j = ł the minimum eigenvalue 
is zero, thus 


8iG+1) +3] _ 83:5+3__ 9 
ào > m | m a (6.70) 
2 
This gives 
9 3 3 
= | = 71 
A0.0 =~ ape + GR 2R2’ ie) 


which is negative. All other eigenvalues for j = $ are positive, for all R. This 
implies that all the other An are greater than zero, since 


lim (à ee Ba = lim (\n)>0 f 0 6.72 
OOF ae a ee A 
Thus also for 7 =0 
3 
An + 4R2 > 0, for n> 0, (6.73) 


for R large, hence there are no other negative eigenvalues. 

In the limit e > 0 we obtain the double-well potential depicted in Figure 6.9. 
There are no bounce-type solutions for this potential. Our solution just becomes 
a ball of true vacuum of infinite radius, R = 381 — oo. There exist only the 
solutions 


o=b- or b=44 (6.74) 


86 Decay of the False Vacuum 


oC) 


$- $+ 


Figure 6.9. The symmetric double-well potential U(¢) 


to the Euclidean equation of motion. This is different from the case of particle 
quantum mechanics, where there are tunnelling-type solutions between the two 
wells. This difference is completely consistent with our understanding of quantum 
field theory in a potential with two symmetric wells of the same depth. In such 
a theory there is spontaneous symmetry breaking. The two vacua, constructed 
above each well, correspond to inequivalent representations of the quantum field. 
They cannot exist in the same Hilbert space, and hence there is no tunnelling 
between them. 


6.5.2 Fluctuation Determinant and Renormalization 
The determinant that we must compute is 


k = det (3,3, +V” (bp) = elm (dee(— ut V"(bs))) — etrn(=3u 3u +V" (H), 


(6.75) 
We expand about ¢= ¢1, then V” (61) + w?, then we have 
k= ett In(—Op Ou tw? +(V"(b4)—-w?)) : 
— ettin(( OpOu+w?) (14+ (-,Opt+w?) ” (V""(by)-w?))) 
= ottln((—0ndp tw?) torn (1+(—Opdn+6?)”"(V!"(bs)-w?))) 
af (— we) (yn ee ae we) 7p" yy ORN 
EP. (( Dp Outw?)*(V!"(by)—w?)—E ((-3u ðu tw?) (V) -w?)) + lew 


where xo = det (-d,0, +a’). The free determinant will be absorbed in the 
definition of the factor K = (eje a of Equation (6.43). 

The first two terms in this expansion are infinite; however, all the rest are finite. 
V” (œb) — w? is exponentially small for p >> R, so we may Fourier transform it to 
obtain 


= 4r TA 
Fy) = f a (V), (6.77) 
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Figure 6.10. Feynman diagram for the first term in the expansion of 
Equation (6.76) 


f(k,) its Fourier transform is then also exponentially small for large k,,. Then 
2\ T1 fyn 2 
tr ((-3u ða +)! (V" (ds) =?) 
1 
= | tteay(el -yg pa DOV" (6s) -Ple 
dh f iku( Tu—yYp) 
= | zdy la tape (V" (gs (a)) —w*) 5(a —y) 
dk ie i ethu(@p-Ypn) etdpty _ 

= [E f a u fe ae ()8(e—) 

= f SE, fata freemen 


= > Pe (/ ra wat i) (6.78) 


The integral over dtk is divergent, and can be represented by the diagram given 
in Figure 6.10. The infinity arising here must be absorbed via a non-trivial 
renormalization of the theory. The next term is 


tr G ((-2, ee We) ) 
= 3 | za'ylel =y E | ga ra) 
(V” (de(y)) — w?) (V" (dole )) =w?) 


=5 fas dt kaa etkn (2—y)u tilu (y—2)utiquyu tipupu ia aie 
(278 (k? +w?) (7? + w?) j 


= 3 | rii dfl 2 Fu) (2 E] (6.79) 


where integrating over x and y obtains two delta functions in momentum, and 
then integrating over k and q eliminates these two variables. The integrals can 
be represented diagrammatically as depicted in Figure 6.11. The integration over 
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Figure 6.11. Feynman diagram for the second term in the expansion of 
Equation (6.76) 


Figure 6.12. General Feynman diagram of the expansion of Equation (6.76) 


l is divergent and also requires a non-trivial renormalization of the theory. 
In general we get a diagram of the form given in Figure 6.12. It corresponds 
to the integral 


fa [E lpi +p2 + +pN)f (Prp) e FPN) 
D +u?) (py)? +w?) (1 DL api)?+w2) 
(6.80) 
It is only the integration over | which can cause problems, the f(pj,,) are 
exponentially decreasing for p;,,— oo. For three or more insertions the integral 
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is finite 


fa : ~ J A (6.81) 

(P +w?) (l= pi)? +w?) (0 - (pi +p2))? +?) P 

The solution of the problem of how to remove the divergences is by adding 
a set of (an infinite number of) counter-terms to the action, which will cancel 
the infinities arising from the integrations. It is a property of a renormalizable 
field theory that all such counter-terms can be reabsorbed into a multiplicative 
redefinition of the coupling constants and fields of the original theory. This means 
that the counter-terms correspond to terms which are of the same form as those 


already present. 


Sbare (¢) = 5 ($) + hs? ($) Teg (6.82) 


where S? (¢) is finite, but S1 ($) is not and the higher terms are not. This implies 
a change in the bounce, which will also be of the form 


by = of thpt, (6.83) 
where oF is the same function as ¢, but now of the renormalized parameters. 
Now 

R 1 R(4R (ary, 09" () 1 2 
Sbare (ot +o +)=5 (oF) +AS (of) + od laho +o(h ) 
= SF (of) + AS! (of) + 0(h?), (6.84) 


where the third term in the first equality vanishes by the equations of motion. 
Then 


Nin 


P (SP (68)? _sP(oR)+nst(oh)+- (det! (—0? +V?" (68) ) ae 
VY mre | det (—02 + V2" (g#)) ( . ) 


with the stipulation that 


Nile 


ast (op) (det! (0? +V?" (68) 
: i det (82 +V?” (¢R)) 


(6.86) 


be finite. We choose $1(¢,) so that we cancel the two divergent terms in the 
expansion of the determinant. This can be made even clearer by ensuring that 
the bare action to o(ħ) vanish at the renormalized unstable vacuum value ¢%. 
This requires 

SE (9%) +AS! (7) =hS' (9%) =0 (6.87) 
since by definition SË (6%) = 0. We can achieve this by subtracting the constant 
ASt (pX) from the bare action in Equation (6.82), giving 


Stare (P) = SF (¢) +A (S! (A) — St (o2)) +. (6.88) 
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This change implies the condition that 


Nine 


n(s!(oB)=s1 (of) (det (—0? +" (68) 
e det (—8? + V2" (6%)) 


(6.89) 


be free of infinities. We see that one factor of counter-terms matches with each 
determinant, ensuring the independent renormalizability. 

In a renormalizable theory, such as $+ theory, it is possible to prove that 
it can be done keeping $1(¢) of the same form as Sjare(¢). In the general 
case, it is clear that the infinities can be cancelled; however, it is not clear 
that it can be done keeping the same functional form of the bare Lagrangian. 
Continuing the perturbative expansion of the functional integral beyond the 
Gaussian approximation will yield higher loop corrections and infinities, for 
which it will be necessary to add further counter-terms, written as h?.9?(¢)+--- 
These again, for a renormalizable theory will be of the same form as the bare 
Lagrangian. We will not belabour the point any further. 

One final avenue for controlling the determinant is to decompose it into angular 
momentum eigen-sectors using 


— 3 +V" (bp) = EV” (be (0) (6.90) 


dp? 


in the angular momentum j sector. The multiplicity of the spherical harmonics 
of order j is (2j +1)?. Then 


x -82 +v""(4p) (2j+1)? 
det! (- +y” (a) 7 Diot, trln ee Ta? —counter terms 
det (82 +w?) l 


(6.91) 
Each term is a one-dimensional determinant which we know in principle how to 
calculate. It is finite. The infinities reappear after the summation over j. 


6.6 Gravitational Corrections: Coleman—De Luccia 


In this section we will consider gravitational corrections to vacuum decay. This 
is eminently reasonable as the application of these methods will be to situations 
where gravity is important, such as the evolution of the universe, where we invoke 
Lorentz invariance. The relevance of gravitational effects to vacuum decay in 
condensed matter systems may not be so important. However, in cosmological 
applications, the consideration of gravitational effects is clearly indicated. This 
analysis was first done by Coleman and De Luccia [33], and we will follow their 
presentation closely. 
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Figure 6.13. The potential with a small asymmetry 


For simplicity, we consider a single scalar field with the Euclidean action 


Sxld] = I de (Ca) 4y o) (6.92) 


which is valid with the absence of gravity. The potential V() will be as in 
Figure 6.13, with a true minimum at ¢_ and a false minimum at +; however, 
we will not assume that the potential is symmetric under reflection ¢ > —¢. 
We will further assume that the value of the potential at each minimum is very 
small, proportional to a parameter e. Thus 


V ($) = Vo(4) + o(€), (6.93) 


where Vo(d+) = 0. 
Adding gravitational corrections may seem pointless at microscopic scales, but 


for other scales they can be very important. Indeed, if a bubble of radius A of false 
vacuum is converted to a true vacuum, an energy in the amount F = e47A3/3 
will be released, and this energy will gravitate in the usual Newton—Einstein 
fashion. The Schwarzschild radius of the gravitating energy will be 2GE. This 
radius will be equal to the radius of the bubble when A = 2GE = 2Gc4rA3/3. 
This gives 

A = (8nGe/3) 71”. (6.94) 


For energy densities of the order of e ~ (1 GeV)‘ this gives a radius of about 0.8 
kilometres. Thus the gravitational effects of vacuum decay occur at scales which 
are neither microscopic nor cosmological, but right in the scales of planetary and 
terrestrial physics. It might well be that gravitational effects in vacuum decay 
are very relevant. 
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Adding the gravitational interaction, the action changes to 


Sele.gu)= f devote | deya(50""0,00,0+V(0)+ eR), 

(6.95) 
where guy is the spacetime metric, g”” its inverse, g is the determinant of the 
metric and R is the curvature scalar. We note that in Euclidean spacetime, 
the determinant of the metric g is positive. Adjusting the zero of the potential 
V(¢) + V(¢) — Vo, Vo a constant, corresponds to adding \/gVo to the action, 
which is exactly the same as modifying or adding a cosmological constant. 
Thus the gravitational spacetime inside the bubble and outside the bubble will 
necessarily be quite different, with different values of the cosmological constant. 
This makes perfect sense with our understanding that gravitation is sensitive to 
and couples to the total energy in a system, including the vacuum energy density. 
Thus we have to specify the cosmological constant of our initial false vacuum, 
of which we are going to compute the decay. The cosmological constant being 
exceptionally small at the present time, we will consider two cases of potential 
interest. First we will consider the possibility that we are living in a false vacuum 
with zero cosmological constant and this false vacuum decays to a true vacuum 
of negative cosmological constant, i.e. V (+) =0. Second, we will consider that 
a false vacuum with a finite, positive cosmological constant decays to the true 
vacuum without cosmological constant where we live, i.e. V(¢_) =0. 


6.6.1 Gravitational Bounce 


We assume that the bounce in the presence of gravity will have maximal 
symmetry, O(4) symmetry. The metric, remember that we are now in Euclidean 
spacetime, then must be of the form 


ds” = dé” + p(€)?d0?, (6.96) 


where dQ? is the metric on the three-sphere 9°, and € is the Euclidean 
radial coordinate and corresponds to the proper radial distance along a radial 
trajectory. p(€) is the radius of curvature of each concentric 9° that foliate the 
space. dQ? can be expressed in a number of coordinates, for example the analogue 
of spherical polar coordinates in R*, or in a more sophisticated manner in terms 
of left invariant 1-forms on the group manifold of SU (2) which is exactly S°. But 
we will not need this part of the metric explicitly and hence we will not exhibit 
it, as we will assume everything is spherically symmetric and hence independent 
of the angular degrees of freedom. 

We can then compute the Euclidean equations of motion. These for the scalar 
field are 


On (V9g"” Ov) — VIV’ ($) = 0. (6.97) 
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Using the rather simple form for the metric and the assumption that our field ¢ 
does not depend on the angular coordinates, we find the equation simplifies to 


de (\/99** ed) — VIV’ ($) = 0. (6.98) 


Furthermore, g** = 1 and \/g = p? (E) yga where go is the determinant of the 
metric of the angular coordinates, which is just the metric on a unit three-sphere. 
ga depends explicitly on the angular coordinates but it does not depend on €. 
Since the only derivative that appears in the equation of motion is with respect 
to €, gq simply factors out of both terms and then can be cancelled. This gives 


0 = ô (Edep) — PEV O) 
= P (€)02¢ + 3p? (E)Oc ped — PEV (e). (6.99) 


Dividing through by p° yields 
30 
32$ + ded =V'(¢). (6.100) 


This field equation is augmented by the Einstein equation Guy = —87GT,,,. The 
sign in this equation is convention-dependent, corresponding to the definition of 
the curvature tensor, the signature of the metric and the definition of the Ricci 
tensor. We will use the sign convention in Coleman—De Luccia [33], which is not 
our favourite convention, but we will stick with it to be close to the original 
paper. The Einstein equation yields only one net equation, 


Gee = —87GTee. (6.101) 


The other components, which are just the diagonal spatial components, are either 
trivial identities or equivalent to this equation. The energy momentum tensor of 
the scalar field is 

Tav = upr — Juv LE. (6.102) 
To obtain the Einstein equation, Equation (6.101), one has to compute the 
Ricci curvature through the Christoffel symbols, which is straightforward but 
somewhat tedious. We will not spell out the details here; with the use of symbolic 
manipulation software, the calculation is actually trivial. We find that there is 
only one independent equation, 


(dep)? =1+ srao G (3gp)? — vo) : (6.103) 


It makes perfect sense that there are only two independent equations of motion, 
as there are only two independent fields, p and ¢. The two equations of motion 
can be obtained from an effective one-dimensional Euclidean action 


Sploo)=2n? fae (0° (ZOP vO) + ga (Poola) ) 
(6.104) 


PE 
8rG 
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The equation of motion for ¢ is straightforward, that for p appears only after 
self-consistently using the derivative of Equation (6.103) to eliminate the second- 
order derivative in its usual equation of motion. 

We solve Equation (6.100) in the approximation that the first derivative term 
is negligible, and the assumption that the potential term can be approximated 
by a function V(¢) = Vo(¢) + o(e) with the condition that V’(¢.) = 0 and 
Vo(¢+) = Vo(¢_). This latter assumption is very reasonable if the actual potential 


is obtained from a small perturbation of a degenerate double-well potential. We 
do not assume that the double well is symmetric, however, just that the minima 
have the same value for the potential. Then Equation (6.100) becomes 


deg = Vo ($), (6.105) 


which admits an immediate first integral 
1 
5 (9)? = Volo) +0, (6.106) 


where C is the integration constant. C is determined by the value of Vo at $4, 
as we are looking for a solution that interpolates from @_ at the initial value of 
€, which is normally taken to be zero, to ¢4 as € + oo. Thus 


5 (060)? = Vol) — Vol+). (6.107) 


This equation can be easily integrated as 
$ é g 
| doy 3(Vo—Volbz)) = f a= 8 -€, (6.108) 
(o4+¢—)/2 3 


where € is the value at which the field is mid-way between ¢, and ¢_, which 
can be taken as the position of the wall. In principle, then, we should solve for 
@ which is implicitly defined by this equation. This will not be done explicitly 
and, continuing implicitly, once we have ¢, we can solve Equation (6.103) for 
p. To solve this first-order differential equation requires the specification of one 
integration constant, we choose that as 


P= pl§), (6.109) 


which is the radius of curvature of the wall. We do not need to have ¢ or p 
explicitly, if all we want is the value of the action for the bounce. This will depend 
on p; however, we can determine J by imposing that the action be stationary 
with respect to variations of J. 

We start with the Euclidean action, Equation (6.104), and integrate by parts 
on the two-derivative term to bring it all in terms of single derivatives. We will 
only be calculating the action relative to its value for the false vacuum, thus it is 
calculated in a limiting fashion as the difference of two terms which separately 
do not make sense and diverge in principle, but the difference is finite. Thus the 
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surface term is irrelevant as we will do the same to the action without the bounce 
instanton with just the false vacuum. This gives 


Sp aan? | ag G (50 | v) = (pp? 4 o) (6.110) 


and then we eliminate p’ with Equation (6.103). This gives the rather compact 


expression 


3 127? 87G 
SE = an? | ag (ov a RRT dtp (1- PY). (6.111) 


Now we use the thin-wall approximation, i.e. we assume that the bounce 
instanton will be much like the same in the absence of gravity, and for e — 0, it will 
be of the form of a thin-wall bubble. We will justify the thin-wall approximation 
after the analysis. Outside the bubble the bounce configuration is entirely in the 
false vacuum and we are comparing the bounce action to the action of exactly 
the false vacuum, thus the contribution to the action is zero 


SE, outside — 0. (6.112) 
Within the wall, we can put p = p, and V > Vo up to o(e) terms, giving 
Sp, wan = 40° p” | dE (Vo($) — Vo(o+)) = 27° S1, (6.113) 


where Sı was defined by Equation (6.27) in the absence of gravity. Finally, 


for the inside of the bubble, @ = ¢+ is a constant, for both cases when we are 
computing the action for the bounce or for the false vacuum, thus we have from 
Equation (6.103) 


(6.114) 


A —1/2 
iia (1 z PV (2)) | 


Thus choosing ¢_ for the bounce and ¢, for the false vacuum we have 


12? f? 87G 5 ne 87G 5 sa 
SE inside = “aG J, pdp ((:- z P v(6-)) = (1- “gf vé) 

Oo 12 1 8TG _» se 

~ (rcp Ces ((: PVG) =i 


1 87G _» 3/2 
zl 3 PV (64) -1)) (6.115) 


SE = SE, outside + SE, wall + SE, inside- (6.116) 


where, 


This yields an unwieldy expression; however, for the cases which interest us, it 
is quite simple. 
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Firstly, for the case 64 = €,ġ— =0, the case where we are living in a spacetime 
after the formation of a bubble, we have the simple expression (after taking the 
limit V(¢_) — 0 in the action SE, inside) 


127? 1 81G _, \*/? 
Sp =2n’pS,4 am ( A4nGp” ((2 5 pe) —1} |. (6.117) 


Then setting the derivative with respect to J to vanish, gives 


dSp 227 127? 81G _, \ 
—— =0=6 S 1+1 ; 6.118 
dp wp Sits oP 3 Pe ( ) 
which is easily solved as 
125; Po 


J= = 6.119 
p 4e+240GS? 1+ (50/24) ( ) 


where fo = 3S1/¢, which is the bubble radius in the absence of gravity, and 
A= /3/(87Ge), the radius at which the Schwarzschild radius of the energy 
from converting a false vacuum to a true vacuum is equal to the bubble radius 
as defined in Equation (6.94). Evaluating the action at the value of p yields 


1 277° St So 
s a z os 
(1+ (0/2A)?) “© (1+ @o/2A)?) 


where S% is the action of the bounce in the absence of gravity. We can obtain 


Sp= 


(6.120) 


this formula by brute force replacement for p; however, we can minimize the 
algebra by noting the Euclidean action, as a function of p, has the form 


Sp = ap -pP +7-6(1-Cp)” (6.121) 


with a = 2r? S1, B = 37/4G, y = 3/16G7e, 6 = 8mGe/3 and p = (3S1/6)/( + 
(fo /2A)*) and the above definitions of jp and A. The action is stationary at p 
hence 


Bap? — 26p — 36 (1— Cp?) 


Then factoring out by 3, multiplying by J and adding and subtracting terms we 


(—Cp) =0. (6.122) 


can reconstruct Sg 


3 (an! BP +y-4(1 cp)” (1p) +51 op?)? = 


3 
(6.123) 
so then we get 
> _9)1/2 

Sp = — ÉP -6(1-CP") i (6.124) 

From the derivative, Equation (6.122), we can easily find 

9 _ 

(ip) ee ee (6.125) 


a ee 4 
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and then we have 3 28 z 
-2 ap 
= | 12 


which now is straightforward to evaluate, yielding Equation (6.120). 
For the second case, V(¢1) = 0, V(¢_) = —e, where we are now living in a 
false vacuum that may decay at any moment, we obtain with similar algebra 


a Po 
i= 


ait 6.127 
(5/24)? men 


while 
Sh 


(1-02a 


For the thin-wall approximation to be valid, we required that the radius of the 


Se= (6.128) 


bubble was much larger than the length scale over which ¢ changed significantly. 
The friction term, (3/p)(d¢/dp), was neglected in Equation (6.12) as the factor 
(3/p) ~ (3/p) + 0. Now in the presence of gravitation we have a different friction 
term, (30¢p)/p), which is given by Equation (6.103) 


1 (@p\* 1 , 8G [1 (do 
eles G v). on 


The first term is the same as without gravity and small if J is large. The second 


term vanishes on one side of the wall, is constant and of o(€) on the other, and 
over the wall it interpolates between these two values. From Equation (6.107) 
it is to lowest order a constant, —Vo(¢;), which in our two cases is of o(e) 
plus corrections which are also of o(¢). Hence we lose nothing by replacing 
it with e. This turns the second term into 1/A?. Hence the two terms which 
control the size of > (ay are negligible, justifying self-consistently the thin- 
wall approximation, if p and A are large compared to the variation of ¢. The 
variation of ¢ is from $4 to ¢_, over the thickness of the wall. This thickness 
is determined by the masses and coupling constants that are in Vo which are 
not taken to be remarkable, i.e. neither very large nor very small. Thus the wall 
thickness will be independent of € and hence the variation of ¢ is of o(1). Thus 
self-consistently, for small €, we can impose that J and A are large compared to 
the variation of ¢, and the thin-wall approximation is justified. It is important 
to note that this puts no constraint on fo/A which governs the difference in the 
solutions Equations (6.119), (6.120), (6.127), (6.128) with gravitation and those 
without, Equations (6.29), (6.30), for ð and Sg above. Thus fo/A can be taken 
as large as we want. Although this may not be phenomenologically relevant, it 
is interesting to consider the possibility. 

In the first case with ¢1 =€,¢_ =0 we see that the effect of gravitation is to 
increase the probability of vacuum decay, as the denominator in Equation (6.120) 
is greater than 1 and hence reduces Sg. Gravitation also diminishes the bubble 
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radius. For the second case, V(¢,) = 0, V(d_) = —e, the effects of gravitation 
are in the opposite direction, making it harder for the vacuum to decay as 
the denominator in Equation (6.128) is less than 1 and can even vanish, hence 
increasing Sp to arbitrarily large values. In this case, the bubble radius is 
increased by gravity, in the limiting case, pushing it to infinite radius at a finite 
value of jio/A = Sı 247G Je. Thus for fixed S4 and e but for increasing G, we 
reach a point when the bubble has infinite radius and its action is infinite, 
completely suppressing vacuum decay. Thus gravitation totally suppresses 
vacuum decay for po = 2A, which means 


€ = 6nGS}. (6.130) 


An explanation of the quenching of vacuum decay is because of energy 
conservation. If we calculate the energy of a bubble of radius ø first in the 
absence of gravitation, we have the volume term and the surface term 


4 
B=- ep? +405," (6.131) 


In this (second) case of interest, V(¢1) =0, V(d_) = —e, thus we are living in 
a false vacuum of zero-energy density and the true vacuum has negative energy 
density. Then using the expression Jo = 3S1 /¢ we have 
E= ur (Po — P), (6.132) 

thus we see that the energy vanishes for the bubble, which is expected as the 
energy before the bubble materialized was zero. Then the effects of gravitation 
can be taken into account, imposing energy conservation. If gravitation increases 
the total energy of the bubble, then the bubble must grow in size to compensate 
and if the gravitation decreases the energy it must shrink. In the case at hand, 
evidently the bubble must grow. 

The gravitational contribution to the energy has two terms. First, the ordinary 
Newtonian potential energy, which is computed by integrating the gravitational 
field squared over all space 


=o 
emp) 
ENewton = TCs (6.133) 


This follows from the straightforward calculation of the Newtonian energy of the 
gravitational field inside a sphere with negative mass density —e. That energy is 


1 
ENewton FR 5 | Peo), (6.134) 
where the gravitational potential satisfies 
V?@(z) = 47G (—e). (6.135) 


Then the energy is given by 


Ba 
ENewton = Jei N = 87 =a feaa , (6.136) 
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where g= —V@ is the gravitational field. Applying Gauss’ law to 


V-g=—4nG(-) (6.137) 
yields 
, 4rG , 
g=— e (6.138) 


for the interior of the bubble. The gravitational field vanishes in the exterior. The 
integral Equation (6.136) quickly yields the result, Equation (6.133). The second 
contribution comes because the existence of the energy distorts the geometry 
correcting the volume of the bubble and hence correcting the volume term in the 
energy. From Equation (6.114) we can write the volume element of the bubble 
2 2 1? 2 
4r p dE = 4r p dp (1-35) +0(G*). (6.139) 


Then integrating over the bubble, the energy density —e yields a change 


2rEpe 


Egeom = ane (6.140) 
giving a total change 
=5 
_ TEPO 
Egrav = 3A2- (6.141) 


Thus the change in energy is positive, which means that, with gravitation, 
the radius of the bubble must increase. It appears that for finite values of 
the couplings and parameters, when Jo = 2A, the bubble size becomes infinite. 
Increasing the gravitational coupling then gives no solution, t.e. the false vacuum 
becomes stable. 

Once the bubble has materialized through quantum tunnelling, we can 
describe its subsequent evolution essentially classically. For Minkowski space-like 
separated points with respect to the centre of the bubble, all we have to do is 
analytically continue the solution back to Minkowski time. Thus for flat space we 
had p? =#-%+77 > #-#—t?. However, we must continue both the solution and 
the metric back to Minkowski time. Thus an O(4)-invariant Euclidean manifold 
becomes a O(3, 1)-invariant Minkowskian manifold. The metric starts as 


ds? = —d&* — (p(€))*dO?, (6.142) 


the negative definite metric being chosen as we wish to continue to a metric 
of signature (+,—,—,—) where dQ? becomes the metric on a unit hyperboloid 
with space-like normal once continued to Minkowski spacetime. For this region 
$= $(p) the solution that we have implicitly assumed to exist (although we have 
not been required to find it explicitly) and for a thin wall, the bubble wall is 
always at p = / and lies in this region. If we are outside a materializing bubble, 
then this is all we have to know about the manifold. It is possible to describe 
further the evolution of the bubble for the two cases that we have considered; 
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however, we will not continue the discussion further, it no longer requires the 
methods of instantons. We recommend the reader to consult the original article 
of Coleman and De Luccia [33]. 


6.7 Induced Vacuum Decay 


We continue our study of the decay of the false vacuum precipitated by the 
existence of topological defects in that vacuum [79, 85]; we restrict our attention 
to the example of the decay of a “false cosmic string”. Such a topological soliton 
corresponds to a topologically stable, non-trivial configuration inside a spacetime 
that is in the false vacuum. We will not worry about gravitational corrections. 
Topological solitons exist when the vacuum is degenerate and, generically, we 
have spontaneous symmetry breaking. 


6.7.1 Cosmic String Decay 


Cosmic strings occur in a spontaneously broken U(1) gauge theory, a generalized 
Abelian Higgs model [61]. This model contains a complex scalar field interacting 
with an Abelian gauge field, hence scalar electrodynamics. However, we consider 
the inverse from the usual case, the potential for the complex scalar field ¢, has a 
local minimum at a non-zero value ¢? = a?, where the symmetry is broken, while 
the true minimum occurs at vanishing scalar field, ¢=0. The scalar field potential 
is considered an effective potential, we do not worry about renormalizability. 
We assume the energy density splitting between the false vacuum and the 
true vacuum is very small. The spontaneously broken vacuum is the false 
vacuum. 

In a scenario where from a high-temperature, unbroken symmetry phase the 
theory passes through an intermediate phase of spontaneous symmetry breaking, 
it is generic that there will be topological defects trapped in the symmetry-broken 
vacuum. Furthermore, the system could be trapped in the spontaneously broken 
phase, even though, as the temperature cools, the true vacuum returns to the 
unbroken symmetry phase. For the complex scalar field, its phase e*”, can wrap 
the origin an integer number of times so that A@ = 2n7, as we go around a 
given line in three-dimensional space. The line can be infinite or form a closed 
loop. Corresponding to the given line there must exist a line of zeros of the 
scalar field, where the scalar field vanishes and corresponds to the true vacuum. 
The corresponding minimum energy configuration (when the roles of the false 
vacuum and true vacuum are reversed) is called a cosmic string, alternatively 
a Nielsen—Olesen string [96] or a vortex string [3]. In the scenario that we have 
described, the true vacuum lies at the regions of vanishing scalar field, thus the 
interior of the cosmic string is in the true vacuum while the exterior is in the 
false vacuum. It is already interesting that such classically stable configurations 
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actually exist. Such strings must be unstable to quantum mechanical tunnelling 
decay. In this section we show how to calculate the amplitude for this decay, in 
the thin-wall limit. 

In [85], the decay of vortices in the strictly two-spatial-dimensional context 
was considered. There, the vortex was classically stable at a given radius Ro. 
Through quantum tunnelling, the vortex could tunnel to a larger vortex of radius 
Rı, which was no longer classically stable. Dynamically the interior of the vortex 
was at the true vacuum, thus energetically lower by the energy density splitting 
multiplied by the area of the vortex. The vacuum energy behaves as ~ —e€R?, 
while the magnetic field energy behaves like ~ 1/R? and the energy in the wall 
behaves like ~ R. Thus the energy functional has the form 


E=a/R?+6R-eR’. (6.143) 


For sufficiently small e€, this energy functional is dominated by the first two 
terms. It is infinitely high for a small radius due to the magnetic energy, and 
will diminish to a local minimum when the linear wall energy begins to become 
important. This occurs at a radius Ro, well before the quadratic area energy, due 
to the energy splitting between the false vacuum and the true vacuum becoming 
important, when e€ is sufficiently small. Clearly, though, for large enough radius of 
the thin-wall string configuration, the energy splitting will be the most important 
term, and a thin-walled vortex configuration of sufficiently large radius will be 
unstable to expanding to infinite radius. However, a vortex of radius Ro will 
be classically stable and only susceptible to decay via quantum tunnelling. The 
amplitude for such tunnelling, in the semi-classical approximation in the strictly 
two-dimensional context, has been calculated in [85]. 

Here we consider the model in a 3+1-dimensional setting. The vortex can be 
continued along the third additional dimension as a string, called a cosmic string. 
The interior of the string contains a large magnetic flux distributed over a region 
of the true vacuum. It is separated by a thin wall from the outside, where the 
scalar field is in the false vacuum. The analysis of the decay of two-dimensional 
vortices cannot directly apply to the decay of the cosmic string, as the cosmic 
string must maintain continuity along its length. Thus the radius of the string at 
a given position cannot spontaneously make the quantum tunnelling transition 
to the larger iso-energetic radius, called R,, as it is continuously connected to 
the rest of the string. The whole string could, in principle, spontaneously tunnel 
to the fat string along its whole length, but the probability of such a transition is 
strictly zero for an infinite string, and correspondingly small for a closed string 
loop. Here we will describe the tunnelling transition to a state that corresponds 
to a spontaneously formed bulge in the putatively unstable thin cosmic 
string. 
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6.7.2 Energetics and Dynamics of the Thin, False String 


6.7.2.1 Set-up We consider the Abelian Higgs model (spontaneously broken 
scalar electrodynamics) with a modified scalar potential as in [85] but now 
generalized to 3+1 dimensions. The Lagrangian density of the model has the 
form 


L= -IFE + (D6) (D9) -V($"9), (6.144) 


where Fuy =0, A, — 0L Ap, and D o = (3, +eA,)¢. The potential is a sixth-order 
polynomial in ¢ [79, 111], written 


VE) =A? — ev”) (|? — v”)?. (6.145) 


Note that the Lagrangian is no longer renormalizable in 3+1 dimensions; 
however, the understanding is that it is an effective theory obtained from a 
well-defined renormalizable fundamental Lagrangian. The fields ¢ and A,, the 
vacuum expectation value v have mass dimension 1, the charge e is dimensionless 
and A has mass dimension 2 since it is the coupling constant of the sixth- 
order scalar potential. The potential energy density of the false vacuum || = v 
vanishes, while that of the true vacuum has V(0) = —Av®e. We rescale as 


pvp AvA, e> Ave z> x/(v? A?) (6.146) 


so that all fields, constants and the spacetime coordinates become dimensionless, 
then the Lagrangian density is still given by Equation (6.144) where now the 
potential is 

Veo) = (dl? -ele - 1). (6.147) 
and there is an overall factor of 1/(Av?) in the action. 

Initially, the cosmic string will be independent of z, the coordinate along its 
length, and will correspond to a tube of radius R with a trapped magnetic flux in 
the true vacuum inside, separated by a thin wall from the false vacuum outside. 
R will vary in Euclidean time 7 and in z to yield an instanton solution. Thus 
we promote R to a field R — R(z,7). Hence we will look for axially symmetric 
solutions for ¢ and A,, in cylindrical coordinates (r, 0, z, T). We use the following 
ansatz for a vortex of winding number n: 

ner; 


(7,0,2,7) = f(r,Rz,7))e™", — Ai(r,0,2,7) = 72 


a(r,R(z,T)), (6.148) 


where €” is the two-dimensional Levi-Civita symbol. This ansatz is somewhat 
simplistic; it is clear that if the radius of the cosmic string swells out at some 
range of z, the magnetic flux will dilute and hence through the (Euclidean) 
Maxwell’s equations some “electric” fields will be generated. In three-dimensional, 
source-free Euclidean electrodynamics, there is no distinct electric field, the 
Maxwell equations simply say that the three-dimensional magnetic field is 
a divergence-free and rotation-free vector field that satisfies superconductor 
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boundary conditions at the location of the wall. It is clear that the correct form 
of the electromagnetic fields will not simply be a diluted magnetic field that 
always points along the length of the cosmic string as with our ansatz; however, 
the correction will not give a major contribution and we will neglect it. Indeed, 
the induced fields will always be smaller by a power of 1/c? when the usual units 
are used. 

The Euclidean action functional for the cosmic string then has the form 


Sp[A,,¢] = xa |e 2 o K 
+ (0,6)* (Org) + (0.9) +2 Dil) Dd) +V(¢"¢)] (6.149) 


where i, j take values just over the two transverse directions and we have already 
incorporated that Ag = A3 = 0. 
Substituting Equations (6.147) and (6.148) into Equation (6.149), we obtain 


212 2(Əa)2 . 
saa lee arr| oot Sa T E ogy 


2e?r2 2e?r2 


5a- Pera", (6.150) 


where the dot and prime denote differentiation with respect to 7 and z, 
respectively. Then å = (2) R and a = = (26) R', and likewise for f, 
hence the action becomes 


Sp= 2% f azar f arr n (22) A) (AH) R) n? (Opa)? 
0 


2e?r2 2e?r2 2e?r2 


(as CRP = (C2) Joo 


53,3 + (Orff) + Gl oP PHP or 1), (6.151) 


We note the two- (Euclidean) dimensional, rotationally invariant form (R? + R?) 
which appears in the kinetic term. This allows us to make the O(2) symmetric 
ansatz for the instanton, and the easy continuation of the solution to Minkowski 
time, to a relativistically invariant O(1,1) solution, once the tunnelling transition 
has been completed. 
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In the thin-wall limit, the Euclidean action can be evaluated essentially 
analytically, up to corrections which are smaller by at least one power of 1/R. 
The method of evaluation is identical to that in [85] and we shall not give the 
details here; we get 


Sp = a | PrZ MRT UP +R?) + E(R(z,7)) — E(Ro), (6.152) 


where 
rn? 
M(R)= RP +R (6.153) 
n? p? > 


® is the total magnetic flux and Rọ is the classically stable thin tube string 
radius. 


6.7.3 Instantons and the Bulge 


6.7.3.1 Tunnelling Instanton We look for an instanton solution that is O(2) 
symmetric. The appropriate ansatz is 


R(z,7T) = R(V 22 +7?) = R(p) (6.155) 


with the imposed boundary condition that R(co) = Ro. It is useful to understand 
what this ansatz means. We expect that the solution will be localized in 
Euclidean two space, say around the origin. Far from the origin, the solution 
will be R = Ro. Thus if we go to T = —ov, the string will be in its dormant, 
thin state, all at R = Ro. As Euclidean time progresses, at some Euclidean time 
T = — R; a small bulge, an increase in the radius, will start to form at z = 0. This 
bulge will then increase dramatically, until at 7 = 0 it will be distributed over a 
region of the original string of length 2R1, the factor of 2 because the radius of the 
O(2) symmetric bubble is Rı in both directions. Then the bubble will “bounce” 
back and shrink and the string will return to its original radius. An alternative 
description is in terms of the creation of a soliton—anti-soliton pair. The instanton 
solution will describe the transition from a string of radius Rọ at T =—oo, toa 
point in 7 = — R; at z=0 when a soliton—anti-soliton pair starts to be created. 
The configuration then develops a bulge which forms when the pair separates to 
a radius which again has to be R, because of O(2) invariance and which is the 
bounce point of the instanton along the z-axis at 7 = 0. Finally the subsequent 
Euclidean time evolution continues in a manner which is just the (Euclidean) 
time reversal of evolution leading up to the bounce point configuration, until a 
simple cosmic string of radius Ro is re-established for 7 > R and all z, i.e. for 
p= Rı. The action functional is given by 

1 (=e 


LMR) w) +E(R(p))—E(Ro)|. (6.156) 
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Figure 6.14. The energy as a function of R, for n = 100,e = .005 and e= .001 
The instanton equation of motion is 
d dR\ 1 dR\* 
— | pM(R)— | —=pM'(R)| — | —pE'(R)=0 6.157 
£ (mE) -ewm (E) — pet (6.157) 


with the boundary condition that R(oo) = Ro, and we look for a solution 
that has R ~ Rı near p = 0, where Rı is the large radius for which the 
string is approximately iso-energetic with the string of radius Ro. The solution 
necessarily “bounces” at T = 0 since OR(p)/OT|-=0 = R’(p)(T/p)|r=0 = 0. (The 
potential singularity at p =0 is not there since a smooth configuration requires 
R'(p)|p-0 = 0.) 

The equation of motion is better cast as an essentially conservative, dynamical 
system with a “time’-dependent mass and the potential given by the inversion of 
the energy function as pictured in Figure 6.14, but in the presence of a “time”- 
dependent friction where p plays the role of time: 


2 
a (ey) — i'r) (=) _ BI(R) =—~ (eZ) (6.158) 
dp dp 2 dp p dp 
As the equation is “time’-dependent, there is no analytic trick to evaluating 
the bounce configuration and the corresponding action. The solution must be 
found numerically, which starts with a given R ~ Rı at p = 0 and achieves R= 
Ro for p > po. We can be confident of the existence of a solution by showing 
the existence of an initial condition that gives an overshoot and another initial 
condition that gives an undershoot, as pioneered by Coleman [32, 23]. If we start 
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at the origin at p =0 high enough on the far right side of the (inverted) energy 
functional pictured in Figure 6.14, the equation of motion, Equation (6.158), will 
cause the radius R to slide down the potential and then roll up the hill towards 
R= Rp. If we start too far up to the right, we will roll over the maximum 
at R = Ro, while if we do not start high enough we will never make it to the 
top of the hill at R = Ro. The right-hand side of Equation (6.158) acts as a 
“time’-dependent friction, which becomes negligible as p —> oo, and once it is 
negligible, the motion is effectively conservative. It is not unrealistic to believe 
that there will be a correct initial point that will give exactly the solution that 
we desire, that as p— 00, R(p) —> Ro. We find the solution exists using numerical 
integration. For the parameter choice n = 100,e = .005 and e = .001, if we start at 
R~ 11,506.4096, we generate the profile function R(p) in Figure 6.15. Actually, 
numerically integrating to p% 80,000 the function falls back to the minimum of 
the inverted energy functional Equation (6.14). On the other hand, if we increase 
the starting point by .0001, the numerical solution overshoots the maximum 
at R= Ro. Hence we have numerically implemented the overshoot /undershoot 
criterion of [32, 23]. 

The cosmic string emerges with a bulge described by the function numerically 
evaluated and represented in Figure 6.15 which corresponds to R(z,r =0). A 
three-dimensional depiction of the bounce point is given in Figure 6.16. One 
should imagine the radius R(z) along the cosmic string to be Ro to the left, 
then bulging out to the the large radius as described by the mirror image of the 
function in Figure 6.15 and then returning to Ro according to the function in 
Figure 6.15. This radius function has argument p = Vz? +72. Due to the Lorentz 
invariance of the original action, the subsequent Minkowski time evolution is 
given by R(p) > R(Vz? — #?), which is only valid for 2? —t? > 0. Fixed p? = 2? — t? 
describes a space-like hyperbola that asymptotes to the light cone. The value of 
the function R(p) therefore remains constant along this hyperbola. This means 
that the point at which the string has attained the large radius moves away 
from z+ 0 to z => œ at essentially the speed of light. The other side moves 
towards z + —oo. Thus the soliton—anti-soliton pair separates quickly, moving 
at essentially the speed of light, leaving behind a fat cosmic string, which is 
subsequently classically unstable to expand and fill all space. 

The rate at which the classical fat string expands depends on the actual value 
of e. Once the string radius is large enough, its boundary wall is completely 
analogous to a domain wall that separates a true vacuum from a false vacuum. 
The true vacuum exerts a constant pressure on the wall, and it accelerates 
into the region of false vacuum. Obviously, if there is nothing to retard its 
expansion, it will accelerate to move at a velocity that eventually approaches 
the speed of light. The only effects retarding the velocity increase are the inertia 
and possible radiation. Radiation should be negligible as there are no massless 
fields in the exterior and there are no accelerating charges. The acceleration, 


6.7 Induced Vacuum Decay 107 


10000 


8000 


6000 


4000 


2000F —- Ry=1292.18 


ğe a a gg ee i 
0 5000 10000 15 000 20 000 25 000 


Figure 6.15. The radius as a function of p 


a, is proportional to pressure divided by the mass per unit area. The pressure 
is simply the energy density difference, p = e. The mass per unit area can be 
obtained from Equation (6.153). Here the contribution to the mass per unit 
length from the wall is simply 7R. Thus the mass per unit area, u, is obtained 
from 7R x L= w27R x L for a given length L, which gives u = 1/2. Then we 
have 

at eju = 2e. (6.159) 


Thus it is clear that this acceleration can be arbitrarily small, for small €, and 
it is possible to imagine that once the tunnelling transition has occurred the fat 
cosmic string will exist and be identifiable for a long time. 


6.7.4 Tunnelling Amplitude 


It is difficult to say too much about the tunnelling amplitude or the decay rate per 
unit volume analytically in the parameters of the model. The numerical solution 
we have obtained for some rather uninspired choices of the parameters gives rise 
to the profile of the instanton given in Figure 6.15. This numerical solution could 
then be inserted into the Euclidean action to determine its numerical value, call 
it So(€). It seems difficult to extract any analytical dependence on ¢; however, 
it is reasonable to expect that as e > 0 the tunnelling barrier, as can be seen in 
Figure 6.14, will get bigger and bigger and hence the tunnelling amplitude will 
vanish. On the other hand, there should exist a limiting value, call it €e, where 
the tunnelling barrier disappears at the so-called dissociation point [126, 81, 80], 
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Figure 6.16. (a) Cosmic string profile at the bounce point. (b) Cut away of 
the cosmic string profile at bounce point 


such that as € > €,, the action of the instanton will vanish, analogous to what 
was found in [85]. In general, the decay rate per unit length of the cosmic string 
will be of the form 


So(€) —So(e) 
c.s. ole a 
T=A (=e e (6.160) 
where ASS: is the determinantal factor excluding the zero modes and (32) is 


the correction obtained after taking into account the two zero modes of the bulge 
instanton. These correspond to invariance under Euclidean time translation and 
spatial translation along the cosmic string [32, 23]. In general, there will be a 
length L of cosmic string per volume L°. For a second-order phase transition 
to the meta-stable vacuum, L is the correlation length at the temperature of 
the transition which satisfies L~' ~ \v?T, [70, 69, 130, 129]. For first-order 
transitions, it is not clear what the density of cosmic strings will be. We will 
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keep L as a parameter, but we expect that it is microscopic. Then in a large 


volume Q, we will have a total length NL of cosmic string, where N = Q/L?. 
Thus the decay rate for the volume Q will be 


Ix (NE)=° (=) = Acs. (29) e7500) È (6.161) 


or the decay rate per unit volume will be 


Trx(NL) T Aa 2r 
no = (6.162) 


A comparable calculation with point-like defects [85] would give a decay rate per 
unit volume of the form 


[point like 


eee 3/2 
if 7 point like int lik 
Apoint like Ge) ae ikee) 


p = T3 (6.163) 
and the corresponding decay rate from vacuum bubbles (without topological 
defects) [32, 23] would be 


vac. bubble 2 
pvac: bubble _— qvac. bubble (2 : pel 2) esr (e) A (6.164) 
T 


Since the length scale L is expected to be microscopic, we would then find that 
the number of defects in a macroscopic volume (i.e. universe) could be incredibly 
large, suggesting that the decay rate from topological defects would dominate 
over the decay rate obtained from simple vacuum bubbles [32, 23]. Of course the 
details depend on the actual values of the Euclidean action and the determinantal 
factor that is obtained in each case. 

There are many instances where the vacuum can be meta-stable. The 
symmetry-broken vacuum can be meta-stable. Such solutions for the vacuum 
can be important for cosmology, and for the case of supersymmetry breaking see 
[1, 47] and the many references therein. In string cosmology, the inflationary 
scenario that has been obtained in [67] also gives rise to a vacuum that is 
meta-stable, and it must necessarily be long-lived to have cosmological relevance. 

In a condensed matter context, symmetry-breaking ground states are also of 
great importance. For example, there are two types of superconductors [7]. The 
cosmic string is called a vortex-line solution in this context, and it is relevant to 
type II superconductors. The vortex line contains an unbroken symmetry region 
that carries a net magnetic flux, surrounded by a region of broken symmetry. 
If the temperature is raised, the true vacuum becomes the unbroken vacuum, 
and it is possible that the system exists in a superheated state where the false 
vacuum is meta-stable [41]. This technique has actually been used to construct 
detectors for particle physics [11, 105]. Our analysis might even describe the 
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decay of vortex lines in superfluid liquid Helium III [86]. The decay of all of 
these meta-stable states could be described through the tunnelling transition 
mediated by instantons in the manner that we have computed. For appropriate 
limiting values of the parameters, for example when € > €e, the suppression of 
tunnelling is absent, and the existence of vortex lines or cosmic strings could 
cause the decay of the meta-stable vacuum without bound. 


7 
Large Orders in Perturbation Theory 


7.1 Generalities 


We can use instanton methods to obtain the size of the terms in large orders of 
perturbation theory. We will first consider particle quantum mechanics [21] and 
then generalize to quantum field theory. The general idea concerns actions of the 


form 


S(¢)= fe (5 ddot a6) (7.1) 


(9) 
o= f Doe~ 2 Tag" (7.2) 
we have from Cauchy’s formula 
1 L(g) 
Th=— od 7.3 
safu GE) (7.3) 


where the integral is over a contour containing the origin. For large k we want 
to perform this integral by Gaussian approximation about a critical point in ¢ 


and g. The critical point must satisfy the equations of motion, h = 1, 


_ S(¢,9) 


Oe hi k 1 
= = dag? A 
i ôg gk F g emi) ee ee 


and the usual equation for ¢ 


8,0" = Ta (7.5) 


EN- 1)! 
Changing the scale 
b+ (9) = (7.6) 
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gives 
k 1 -N 
7 ON) (mg) Nt [ža (7.7) 
which implies 
I k z +1 an. 
ON) Taryn Sloe Sag) A (7.8) 
which in turn means 
The other equation is simply 
Sp AT yp? NTI _2N—1 
eS SORE ores aioe (7.10) 
which should have a solution with 
fia <o. (7.11) 


We find such a critical point in various examples, and then perform the integrals 
by Gaussian approximation. 


7.2 Particle Mechanics 


In particle quantum mechanics we consider the Hamiltonian 


N 
1S zed 7 2 Z 2 
=e Sata (Soe (7.12) 
which describes n anharmonic oscillators which interact with each other. Then 


tr (e-Bb# HPE aa 
lim : n( Jaa ) = lim : In ( ite ) =E -— Ep, (7.13) 


tr (e~8Ho) e7 Po +... 


where E and Eo are the ground-state energy of the system and the corresponding 
free system. The ratio of the traces can be expressed as a path integral 
tr (e~8#) 
tr (e~8Ho) 


=N | Daren E r Gitaa) (7.14) 
periodic 
N is chosen so that the ratio is equal to 1 for g = 0. Periodic (T) converts 
the path integral into a trace. The term of order k is extracted via the Cauchy 
theorem. This integral corresponds to the integral in Equation (7.2). 
For particle quantum mechanics, however, we can actually perform the g 
integration exactly, 


=(= (7.15) 


= Ber aN 
1 e-s SE dr|a|?% y g dr |z| ) 
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thus we find 
tr (e~8#) 
tr (e— Ho) i 


; | 
-CDN f popje EGPA) emana), 716 


periodic 


To perform the path integral, we look for a critical point in 7, to the equation 


of motion E 
2N kg|gz|4- 
#=2 d l (7.17) 
So ar |@?% 
Changing the scale by > [Traraan® yields the equation 
0 TL 
f= z- gz., (7.18) 
The solution is easily found, 
T= Uxo (T — To) (7.19) 
with |u|? = 1 and the function xo given by, 
= N 
No ater 77 ar (7.20) 
cosh* ((N — 1)r) 
where xo(T) satisfies 
Xo = £9 — X(t)? N9. (7.21) 


This is most easily verified by observing 


de apices INN De - 
Hayat) 2 oe NO aN SE Ue en 


dr cosh®(N — 1)r 
(7.22) 
But q 
zo = 2(N —1)220-D-1 5, (7.23) 
and therefore d 
g2 = ~ 20 tanh(N — 1)r. (7.24) 
Finally 
do = —totanh(N — 1)r +.29(N — 1)sech?(N — 1)r 
= £9 tanh? (N — 1)r + zo(N — 1)sech?(N — 1)r 
= xo (1—sech?(N — 1)r — (N — 1)sech?(N — 1)r) 
N 
a 
cosh*(N — 1)r 
= zo — Tox) 0P (7.25) 


as required. Periodicity is satisfied if we begin at 7 = B and end at T= 2 As 
8 — œ the action is calculable, 


S [uxo (T — 7)| = Nk — Nkln (2Nk)+k(N —1)ln(J), (7.26) 
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where 


=f 5 2 er C5) i (7.27) 


It remains to calculate the determinant corresponding to the Gaussian 
fluctuations about this critical point. The operator coming from the second 
variation of the action is 


82 


d? N 
Za (71) 6% (T2) Pe 


sao (( dr? i cosh? ((N — Da) nee 
aN Neat Jat z 
cosh“ ((N — 1) rı) 


L 
aie 


2N 2 - 
+ Uatipxg” * (41) ag"? (72) 


= Myugug + Mr (dap —Uatig) (7.28) 


2 — 
m=( g +1 eet Jan T2) 4 ON 28-1 (74) 02%! (xy) 


dr? cosh? ((N —1)7) J 
(7.29) 
and 
d? N 
Mr = + 1 ô ; 7.30 
i ( dr? e) (m= 7) ( ) 


For the transverse operator Mr, the corresponding “quantum mechanical” 
Hamiltonian is 
2 A(A+1) 


M Epe a 
cosh“ (x) 


(7.31) 


which is exactly solvable. The eigenfunctions are the Jacobi functions. The ratio 
of the determinants is given by 


det (H — 2) r(1+v=z)r (V—2z) 


det(Ho— 2)  T(1+A+V=2z) 0 (V=z-))’ 


which is calculated using the Affleck Coleman method [31, 114, 36], where I is 
the usual gamma function. For the case at hand, À = wl. ( Wo which 


gives the transverse operator up to a factor of 1/(N —1)?. We must separate out 


(7.32) 


i 


the zero modes. These arise because of the invariance of the original Hamiltonian 
under global rotations of ¥ (equivalently of t). Rotations about a direction 
orthogonal to w@ should all be in the transverse operator. Thus the zero mode for 
Mr is simply x9 (rT). 

nob (wa (7) = (du) To (T) , (7.33) 


7.2 Particle Mechanics 115 


d det’ (H—z) 
det(Ho—z)? 
must divide the ratio by the smallest eigenvalue for not equal to its critical 


where (di) counts the number of independent rotations. To fin we 


value, and then take the limit. Now 


@ X(A+1) 


H-z= rR 7.34 
dx? cosh?(x) ee 
however, if we scale x > (N — 1)t, with A= wo ZS Woe we get 
1 d? N 
H-z= +1). 7.35 
(N —1)? ( dt? cosh?((N — 1)t) ) ae 
Then with z = — Wop? +e the zero mode becomes an eigenmode with 
eigenvalue e. Each eigenvalue of H — z is Wop? times the eigenvalue of 
£ ERTEN +1. Thus we must divide by e(N — 1)? to get det’. Hence 
Qn det(H—2) _ 2n(w +1) (xs) 
lim = (7.36) 


z>- det(Ho—z) 2(N—1 
>- (- wipe -2) (N-1)2det(Ho-2) 2(N-1) p(s) 


The 27 comes from the definition of the measure in the Gaussian integral. There 
are n — 1-independent transverse directions, for each one we have the same det’, 
to the power — 5, giving the total power — r, The Jacobian factor coming from 
changing the integration variable from the zero mode “Gaussian fluctuation” to 
the integration over the position gives a factor of ,/ [°° dra? (r) = (k(N +1)) 2, 
Thus the total contribution of the transverse modes is 


(2m)2 wt | x(w 41) 1? (5) 
gy NF) =n) F(a) 


The first factor is the volume from integrating over the directions of u. The 


(7.37) 


= 


longitudinal operator can also be treated in a similar fashion. With 


- d (2N —1)N 

Mr = +1 7.38 
2 qe cosh? ((N — 1}t) (a3) 
where |u)(u| projects on the mode 22’~'(r). There is one zero mode coming 

from time translation invariance, Soy It is orthogonal to x2! (r) 

dx (T) a we" Gy ae GI 
2N-1 oT) _ 0 _ 70 — 

J: dreg ~ (T) I- sf dr IN ~~ aN | 0 (7.39) 
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Then with |v) denoting eigenstates orthogonal to |u) 


det (Mz + |u) (u|) = det (Mz) det (1 + Mz |u) (ul) 


= det (Mz) det (: + (mu +> oei) vie) 


= det (Mz) det | 1+ (uJ Mg lulu) (ul +X- (ol Mz lu)lv) lul 
= det (Mz) (1+ (u|Mz"|u)) (7.40) 


where the final equality follows because the second determinant is of a matrix 
that is upper triangular. From the equation of motion 


ao 0 = f Al 
1-Ny° °° eal) 
Thus 
es JL dreh t Mpeg 
a EE 
T dt (xô ) 
1 © dre?™-tr 
= J- 4720 a (7.42) 
IN) par (23°) 
The integrals can be done exactly, giving 
e _ 
det (Mz) = ~——~ det (Mrz) (7.43) 


(N —1) 


Mz has a negative mode, which cancels the minus sign. This is not an instability, 
M, is just an auxilliary operator. Now we have an operator of the same form as 
the transverse part before 


@ A(A+1) 
A = 7.44 
oS de cosh? (x) a 
with à = %7; and z= Wop? Then the det’ is, 
2(_N 

t (H — iP ( = ) 
lim | a) - 2) =— rz (7.45) 

ogy Sto- 2) laa wy r (#5) 


The Jacobian from the usual change of variables in the integration is 


(FAE worm ew 


which then gives the total contribution 


(k(N —1))2, (7.47) 
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where the 8 comes from the integration over the position of the instanton. 
Finally, putting all the pieces together, we get the correction to the kth order in 
perturbation to the energy splitting 


1 n+1 r( 2N ) kK(N-1)+5 
Lip ahead 2 2((N=—1)) 2 k2 Wo 
E -— Eo =(-1)"""¢ (2) T (2) 24 r(x) 
x kev) (14019). (7.48) 


7.3 Generalization to Field Theory 


This result can be generalized to the case of quantum field theory; we leave this 
for the reader. We should make one point, though. Generally, we do not believe 
that the functional integral is an analytic function in an annulus around the 
point g =0 in the g-plane. Indeed, for g negative the Hamiltonian is not self- 
adjoint for sufficiently large N. We expect that in reality Z(g) is defined in the 
complex plane by analytic continuation, and this analytic function has a branch 
cut along the negative real axis which terminates at g =0. We must use the once 
subtracted dispersion relation 


(Oni Jo At g 2ri (g'—g)’ 
(7.49) 


which corresponds to the contour in Figure 7.1. If the second integral vanishes 


R 
7 EEE A E E eee E ee een) E E E 


as R —> œ we get 


ERS 1 ; Saot 
T(g)= S ara (discontinuity (Z(—A))). (7.50) 
For 
L(g) =) Trg" (7.51) 
k 
we have bye i 
k ; winced 
Tk = —(-1) a dT (discontinuity (Z(—A))). (7.52) 


The factor 1/A*+! becomes more and more singular at the end point À = 0; thus, 
if we know how the discontinuity of Z(—A) behaves for small A, we can find the 
behaviour of Zp for large k. For an expected asymptotic behaviour 


discontinuity (Z(—A)) ~ 2iBe7 * 7° aA! (7.53) 
l 


implies directly 


1 _4)k _ 7) g-(k+a-l1) 
T~——(-1) Per te 1) S- . (7.54) 
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Figure 7.1. Integration contour in the complex g plane for field theory 


The discontinuity of discontinuity (Z(—A)) can actually be computed using the 
semi-classical methods that we have been learning about. Collins and Soper [35] 
show that it has an expansion of exactly the form given in Equation (7.53). Thus 
the formal calculations that we have done, not worrying about the cut in the 
complex g plane, produce the same results with much less difficulty. 


7.4 Instantons and Quantum Spin Tunnelling 


We continue this chapter with an application to quantum spin tunnelling. This 
calculation starts out as an independent tunnelling calculation that, in principle, 
has nothing to do with large orders in perturbation theory. However, it turns 
out that the tunnelling calculations are all attainable through large orders in 
perturbation theory. We will have to understand what spin-coherent states are 
and the corresponding path integral. 


7.5 Spin-Coherent States and the Path Integral for Spin Systems 


For a spin system, instead of the orthogonal position |x) and momentum |p) 
basis we define a basis of spin-coherent states [106, 100, 75, 87]. Let |s,s) be 
the highest weight vector in a particular representation of the rotation group. 
This state is taken to be an eigenstate of the operators S. and $ , two mutually 
commuting operators: 


S.|s,s) =s|s,s) $?|s,s)=s(s+1)|s,s). (7.55) 
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Figure 7.2. The directions of the unit vectors Z and n on a two-sphere 


The spin operators S;, i = x,y,z satisfy the Lie algebra of SU(2), 
[S;, S3] = icije Sr (7.56) 


where ¢€;; is the totally antisymmetric tensor symbol and summation over 
repeated indices is implied in Equation (7.56). 
The coherent state is defined as [100, 87, 75, 127, 49, 48] 


s 
ja) = lS s,s) = S> M )malsm), (7.57) 
m=—s 
where |ñ) is an element of the 2s + 1-dimensional Hilbert (representation) space 
for the spin states, ù = (cosdsin§, sin dsin 8, cos0) is a unit vector, i.e. A? =1 and 
m= (fi x 2)/[ù x 2| is a unit vector orthogonal to ù and £. Z is the quantization 
axis pointing from the origin to the north pole of a unit sphere and Ĥ -2 = cosé 
as shown in Figure 7.2. Rotating the unit vector ñ counterclockwise about the m 
direction by the angle @ brings it exactly to the unit vector Z. |) corresponds to 
a rotation of an eigenstate of S., i.e |s,s), to an eigenstate with a quantization 
axis along ù on a two-dimensional sphere S? = SU(2)/U(1). It turns out that 
the matrices M*(f) satisfy a non-trivial relation 


ME (A1) M° (fig) = M° (fg )Jet M1 Fah) Se (7.58) 
where G (1, M2, 73) is the area of a spherical triangle with vertices f1, 2,3. Note 
that Equation (7.58) is not a group multiplication, thus the matrices M€ (ñ) do 
not form a group representation and G(fi1,/2,73) is called a co-cycle, which 
represents the obstruction that the matrices M° (ñ) exhibit to forming a true 
representation of the rotation group. 

Unlike normal position and momentum eigenstates, the inner product of two 
coherent states is not orthogonal: 


(1+: A) (7.59) 
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It has the following property: 
i- SIA) =s|A) > (AI A) = så (7.60) 
while the resolution of identity is given by 


EL nsa —1)|A) (Al, (7.61) 


At 


f= 


where Î is a (2s+1) x (2s +1) identity matrix, and the delta function ensures 
that ù? = 1. Using the expression in Equations (7.59) and (7.61) we can express 
the imaginary time transition amplitude between |f;) and |i) as a path integral, 
which for the spin system is given by [48, 127] 


(ple PEO h) = J REA, (7.62) 


where 


Spl] = isSwz + / drU(a(r)), U(A(r)) = (Al AA) (7.63) 


and Swz arises because of the additional phase e*°9(-"".2) in Equation (7.59). 
We have set A= 1 in the path integral. 

The Wess—Zumino (WZ) action, Swz in the coordinate independent formal- 
ism, is given by! [97, 122, 120, 49, 48] 


Swz= f 7 drdéfi(7,€) - [O,f(7,€) x Oehi(7,€)], (7.64) 


where /i(7) has been extended over a topological half-sphere iS ? in the variables 
T,€. We call this the coordinate independent expression since no system of 
coordinates is specified for the unit vector n. In the topological halfsphere we 
define with the boundary conditions 


n(7,0) =A(r), A(7,1) =2 (7.65) 


so that the original configuration lies at the equator and the point € = 1 
is topologically compactified by the boundary condition. This can be easily 
obtained by imagining that the original closed loop (T) at € = 0 is simply pushed 
up along the meridians to n(7) = 2 at £= 1. The WZ term originates from the 
non-orthogonality of spin-coherent states in Equation (7.59). Geometrically, it 
defines the area of the closed loop on the spin space, defined by the nominally 
periodic, original configuration (T). It is crucial to note that there is an 
ambiguity of modulo 47, since different ways of pushing the original configuration 
up to the north pole can give different values for the area enclosed by the closed 
loop as we can imagine that the closed loop englobes the whole two sphere any 
integer number of times, but this ambiguity has no physical significance since 


1 An alternative way of deriving this equation can be found in [16]. 
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e'N4™s — 1 for integer and half-odd integer s. The action, Equation (7.63), is 
valid for a semi-classical spin system whose phase space is S?. It is the starting 
point for studying macroscopic quantum spin tunnelling between the minima of 
the energy U (ñ). 


7.6 Coordinate-Independent Formalism 


In the coordinate independent formalism, the spin is represented by a unit vector 
û(T) but no parametrization of the unit vector is assumed. It is best to exemplify 
the coordinate independent analysis through an explicit system. 

We will study the simplest biaxial single-molecule magnet whose spin 
Hamiltonian is given by 


H=-K,82+K,S?, K,>K,>0. (7.66) 


The above Hamiltonian possesses an easy-axis in the z-direction and a hard-axis 
in the y-direction. When Ky = 0, the spin is localized along the z-axis, which 
is usually parameterized by the variable 6 = 0,7 and possesses two degenerate 
minima localized at the north and the south poles of the two-sphere of phase 
space. Addition of small kK, #0 introduces dynamics into the system and causes 
tunnelling. The real tunnelling variable is expected to be @ in the easy-axis 
direction. 

The Hamiltonian defined above has been studied in the presence of a magnetic 
field by many authors [28, 27, 53]. However, the quantum-phase interference for 
this model has not been computed, due to the subtlety involved in computing the 
action for the instanton. Since the relation $? = 8? F s F 82 = s(s +1) holds for 
any spin system, it is evident that any biaxial single-molecule magnet is related 
to any other either by rescaling the anisotropy constant or by rotation of axes. 
For instance, the Hamiltonian studied by Enz and Schilling [43]: 


H =—AS?+BS?, (h=0), (7.67) 


possesses an easy x-axis and a hard z-axis. This model in the conven- 
tional spherical parametrization in terms of the phase space variables, n = 
(sin@cos¢,sin@sing,cos@) is exactly our Hamiltonian Equation (2.15) in the 
unconventional spherical parametrization ù = (sinô sinġ,cos0,sinĝ cos) with 
K,=Aand K, =B. 


7.6.1 Coordinate-Dependent Analysis 


To demonstrate the technique for investigating the quantum-phase interference 
in the z-easy-axis model, we will first keep to the conventional, coordinate- 
dependent spherical parametrization, ù = (sin cosġ,sinĝsinġ,cos0). It was 
shown [52] that perturbation theory in the Ky term for integer spin leads 
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to an energy-splitting proportional to (K,)* while for half-odd integer spin, 
the splitting vanishes in accordance with Kramers’ theorem. We will recover 
this result using the spin-coherent state path integral, and we will explicitly 
demonstrate in all detail that the result can be obtained without recourse to a 
coordinate-dependent parametrization. 

The transition amplitude in the spin-coherent state path integral, in the 
coordinate-dependent formalism, is given by [75] 


(Or bjle-|6:,4%) = J D [cos] D [ø] e752" (7.68) 
where the Euclidean action is 
TR f dr fisd(1—cos6) + E0, 4) (7.69) 


where the first term is the WZ term in the coordinate dependent formalism and 
the classical anisotropy energy Equation (2.15) is 


E(0,¢) = (K; + Kysin? ¢) sin” 0. (7.70) 


We note that the WZ term, being first-order in time derivatives, remains 
imaginary upon analytic continuation to Euclidean time. This has important 
ramifications for the putative instanton solutions: they too must have non-trivial 
imaginary parts. The classical degenerate ground states correspond to ¢ = 0, 
0 =0,7, that is the spin is pointing in the north or south pole of the two-sphere. 
The classical equations of motion obtained by varying the action with respect to 
0 and @¢, respectively, are 


isosind = er ae (7.71) 
isO sind = Eeo, (7.72) 


It is evident from these two equations, because of the explicit i, that one variable 
has to be imaginary for the equations to be consistent. The only appropriate 
choice is to take real 0 and imaginary ¢, since the real tunnelling coordinate 
(z-easy-axis) is 0. This comes out naturally from the conservation of energy, 
which follows by multiplying Equation (7.72) by ġ and Equation (7.71) by 6 and 
subtracting the two: 


dE (0 
ae ee) =0 ie, E(6,¢)=const. =0, (7.73) 
= 
the normalization coming from the value at 0 = 0. Thus, 
E(6,¢) = (K; + Kysin? ¢) sin? 0 =0. (7.74) 


Since sin? @ 40, as it must vary from 0 to 7, it follows that, 


sing = +2, | = (7.75) 


7.6 Coordinate-Independent Formalism 123 
therefore @ is imaginary and constant. Let ¢ = or + id, then sing = 


sing@rcosh¢; + icosdrsinh¢d;. We must take ør = nr so that sindr = 0 as 
the right-hand side of Equation (7.75) is imaginary. Hence 


(—1)”" sinh dy = Le ea (7.76) 


as cosérz = (—1)”. There are four solutions of this equation: n = 0, 6 = ir and 
n=1, 6=7-—i¢, for the positive sign and n=0, = —iġr and n=1, d6=1+i¢7 
for the negative sign, where ¢; is the same in both cases. Taking into account 


that K,>> Ky >0, we have ġr = aresinh ( Ke) es 1 In (442) 


The classical equation of motion (7.72) simplifies to 


ô : eel Ge 
isg ysin2¢ = iKysinh 2¢ġr (7.77) 


The solution is easily found as 
0 (T) = 2arctan[exp(w(T — To))], (7.78) 


where w = Ky sinh2¢;. This corresponds to the tunnelling of the state |f) from 
0(T)=0 at T= —o to the state |4}, 0(7) =m at T= oo. The two solutions 
o= iġr and ¢=7+i¢, in the upper-half plane of complex ¢ correspond to the 
instanton, (6 > 0) while the solutions ¢ = —i¢; and ¢=7 —i¢, in the lower-half 
plane of complex ¢ correspond to the anti-instanton, (Ê < 0). 

Since the energy, F'(@,¢), in the action Equation (7.69) is conserved and 
therefore always remains zero along this trajectory, the action for this path is 
determined only by the WZ term which is given by 


oo 
Sp = Swz = is f dr ọ(1— cos) = is {i do(1—cos@). (7.79) 
—oo $i 
Naively, one can use the fact that ¢ is constant and hence d= 0, which seems 
to give Swz = 0; however, care must be taken when computing the action. A 
non-zero Euclidean action is found by realizing, as in [99], that we must take into 
account the fact that ¢@ must be translated from ¢=0 to ġ = nr + ir before 
the instanton can occur and then back to ¢=0 after the instanton has occurred. 
Since the action is linear in time derivative of ¢, the actual path taken does 
not matter, only the boundary values matter. In the present problem, we have 
two solutions for ¢, i.e. 6=i¢; and 6=7 +i; corresponding to two instanton 
paths, call them J and JI. The full action is then 
n+iby 0 
SE =is | dé(1—cos8)|g-9 tis | do(1— cos0)lo=r 
0 T+idy 
= —2ris+ 28g; (7.80) 
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and 
ib] 
sH =is | do(1— cos0)|o=0 (7.81) 
0 


0 
+is do(1— cos) |92n = 28¢7, 
ibr 
where it is clear that the total derivative term dọ contributes nothing as the two 
contributions cancel in the round trip, while the décos@ gives all the answer, 
since cos@ = 1 initially, before the instanton has occurred, while cos@ = —1 after. 
The action for the corresponding anti-instantons is identical. The amplitude for 
the transition from 0 = 0 to 6=7, as usual, is calculated by summing over a 
sequence of one instanton followed by an anti-instanton with an odd total number 
of instantons and anti-instantons [31], but we must add the two exponentials of 
the actions S and SẸ for both instanton and anti-instanton. We note 


eSB 4 SH = e~ 281 (1+¢?7"*) =e" 751 (1+ cos2rs), (7.82) 


where the last factor vanishes for half-odd integer spin. Then we get that the 
expression for the amplitude is given by 


(w|e~8# |0) = sinh («8 (1 + cos(2ms))e~ 2°?!) (7.83) 


where « is the properly normalized square root of the determinant of the operator 
governing the second-order fluctuations without the zero mode, which we have 
not computed, and M is the usual normalization factor. The energy splitting can 
be read off from this expression 


AE = «(1+ cos(2ms))e~ 2°. (7.84) 


For half-odd integer spin the splitting vanishes, while for integer spin we have 


KN 
AE =4k ( ee ) (7.85) 


which agrees with the result found by perturbation theory [52]. 


7.6.2 Coordinate-Independent Analysis 


Now we wish to see that the spherical-polar coordinate-dependent parametriza- 
tion of the unit vector ù is not at all necessary. Then the action for the 
Hamiltonian in Equation (7.66) can be written as 


Sr = fares = far [-K.(fi- 2)? + K,(fi-§)7] 


+is | drdê [ñ (O,f x Oeh)]. (7.86) 
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The first term is the anisotropy energy while the second term is the WZ term 
written in a coordinate-independent form. The WZ term is integrated over a two 
manifold whose boundary is physical, Euclidean time 7. Thus the configuration 
in 7 is extended into a second dimension with coordinate €. The equations of 
motion arise from variation with respect to ñ. However, 7 is a unit vector, hence 
its variation is not arbitrary, indeed, -ñ = 0. Thus, to obtain the equations of 
motion, we vary 7 as if it is not constrained, but then we must project on to the 
transverse part of the variation: 


Taking the cross-product of the resulting equation one more time with ñ does 
no harm, and this process yields the equations of motion 


isO,ft—2K.(f- 2)(Ax 2) +2K,(fi-9)(A x §) =0. (7.88) 


Taking the cross-product of this equation with 0,7, the first term vanishes as 
the vectors are parallel, yielding 


—2K,(f-2)0-n x (Ax 2) + 2K, (A. 9)O-nx (A x GJ) =O. (7.89) 


Simplifying the triple vector product, using 0,/-7% = 0, and then taking the scalar 
product of the subsequent equation with ù gives 


0, (—K,(fi- 2)? + K,(A-§)”) =0, (7.90) 


which is the conservation of energy. The initial value of ù = 2 says that the energy 
must equal 
(—K.(f- 2)? + Ky(f-§)?) = -K (7.91) 


From this equation and because fi is a unit vector we find 


Kirpa . Kz A 
agaa- 1) -+j a- (A+ 2)2) 
K+K, te 
eee (a-2)?), (7.92) 
Ky 
where the + signs are not correlated. Then 
d- . K, 
a | SS 8 ; 7.93 
a Nae Li 


hence, we recover the result immediately that is a complex constant, just as 
before. Taking the scalar product of Equation (7.88) with 2 yields 


isO,(A-2)+2K,(A-§)(A-#) =0 (7.94) 
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and replacing from Equation (7.92) gives 


is, (ñ 2) +2i,/K,(Ky + Kz)(1— (ñ: 2)?) =0 (7.95) 


Notice that the z’s neatly cancel leaving a trivial, real differential equation for 
n- 2, which we can write as 


O,(A-2) Or 


oe) 4 
1—(A-2) ` Te Gey tg VEK +K) (7.96) 


This integrates as 


1 A- 4 
nsi K,(Ky + Kz)(1 — 70). (7.97) 


Exponentiating and solving for -ê gives 


A-Z=+tanh (= K.(K,+K;)(r-7)) s (7.98) 


which is exactly the same as the solution found for 0 in Equation (7.78). The 


instanton (upper sign) interpolates from n, = 1 to n; =—1 as T > +00. 

Thus it is important to know that the equations of motion can be solved 
without recourse to a specific choice for the coordinates. We will now evaluate 
the tunnelling amplitude and the quantum interference directly in terms of the 
coordinate-independent variables. Since the energy remains constant along the 
instanton trajectory, the action is determined entirely from the WZ term 


1 
Swz = is farf dé [â : (O-n x Ozh) ] : (7.99) 
0 
The integration over € can be done explicitly by writing the unit vector as 


17,6) = F(7,8)mz(7)2 + 9(7,§) [na (7) + ny (7g (7.100) 


with the boundary conditions À (T, =0) = A(T) and fA(7,€ =1) = 2, where we 
write n, for A(r)- 2%, etc. Using the expression in Equations (7.100) and the 
condition that ñ -ñ = 1, one obtains 
Ta 2m2 
r (7.101) 


2 
l-n? 


These functions obey the boundary conditions 


f(r.§=0) =1.f(ng= =, 
g(7,€ =0) =1,9(7,€ =1) =0 (7.102) 


The integrand of Equation (7.99) can now be written in terms of the 
functions defined in Equation (7.100). After a straightforward, but rather tedious, 
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calculation we obtain 


A: (Op x Oeh) =n2(9? F’ — fgg’) (Naty — Nye) 


, 


my, 
l-n; 


where f’ = ðe f, fisy =O;Nzx,y. The second equality follows from Equation (7.101). 
Replacing Equation (7.103) into the WZ term, the € integration in 
Equation (7.99) can be done explicitly which yields 
i Nyy — Nyy 
Swz Sis f dr Det ahe), (7.104) 

This expression defines the WZ term in the coordinate-independent form as a 
function of time alone. We can always make recourse to any specific coordinates, 
taking the z-easy-axis system, with the spherical parameterization we recover the 
usual form of the WZ term in condensed matter physics, i.e. Equation (7.79). 
Multiplying the top and the bottom of the integrand in Equation (7.104) by 
(1—n,), the resulting integrand simplifies to 

Uny/Ne) a 
1+ (ny/ne)? 


= is | dfarctan(ny/n,)](1~n.) 
=is | d-n), (7.105) 
which is rather analogous to the coordinate-dependent expression in 


Equation (7.79). 
It was already noted from Equation (7.93) that ¢ has to be imaginary. 


Swz=is Nz) 


To recover the quantum-phase interference in the coordinate-independent 
formalism, ¢ must be translated from the initial point, say ¢ = 0, to the final 
point, o =nr +iġr, n=0,1 before and after the instanton occurs [99]. The two 
contributions to the action from these paths are given by 


T+iby 0 
Siz ais / doi =a E E =n E 
0 T+idy 


= —2ris +2sġr (7.106) 


and 
ir 
Stig =is | d6- n) (7.107) 
0 


0 
tis | do- na)lns=1 = 2501 
ibr 


which are exactly the expressions as before. Then the previous evaluation 
quantum interference goes through unchanged. 
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7.7 Instantons in the Spin Exchange Model 


We will study a second example where instantons give rise to quantum tunnelling 
in spin systems and breaks the degeneracy, the case of two large, coupled, 
quantum spins in the presence of a large, simple, easy-axis anisotropy, interacting 
with each other through a standard spin-spin exchange coupling, corresponding 
to the Hamiltonian 


H =—K(S?,+82,) +51 - So. (7.108) 


We will take K > 0 and the case of equal spins S= =S. A> gives an anti- 
ferromagnetic coupling while A < 0 sign corresponds to ferromagnetic coupling. 
The first term gives rise to the anisotropy, favouring an easy-axis, the z-axis, 
the first term’s contribution to the energy is obviously minimized if the spin 
is pointing along the z-direction and is as large as possible. The second term is 
called the Heisenberg exchange energy interaction. The spins S; could correspond 
to quantum spins of macroscopic multi-atomic molecules [113, 116, 90], or the 
quantum spins of macroscopic ferromagnetic grains [28, 27], or the average spin 
of each of the two staggered Neél sub-lattices in a quantum anti-ferromagnet 
[116, 91, 92]. 

A Néel lattice is simply a spin system where adjacent spins are maximal and 
point in opposite directions. It is the epitome of anti-ferromagnetic order. We 
will be exclusively looking at one dimension, thus what are called spin chains. 
As the spins on a lattice are distinguishable, one choice starting at a given spin 
of up, down, up, down, --- is a different configuration from down, up, down, up, 

--, starting from the same spin. This twofold degeneracy is akin to the two- 
fold degeneracy of a ferromagnetic system, where all spins could point up or all 
spins could point down. The direction of the up and down is determined by the 
anisotropy, which picks out a favoured direction for the spins. In this section, we 
will only consider two spins, but in the next section we will generalize our results 
to a spin chain. 

The non-interacting system of our Hamiltonian is defined by A = 0, here the 
spin eigenstates of $;,, notationally |s,s,,) Q |s, S22) = |$12,82,), are obviously 
exact eigenstates. The ground state is fourfold degenerate, corresponding to the 
states |s,s), |—s,—s), |s,—s) and |— s,s), which we will write as fM, t), N, 4}, [tL 
), Kt), each with energy E = —2Ks?. The first excited state, which is eightfold 
degenerate, is split from the ground state by energy AE = K(2s—1). 

In the weak coupling limit, A/K — 0, an interesting question to ask is what is 
the ground state and the first few excited states of the system for large spin 5. 
For spin 1/2, the exact eigenstates are trivially found; for spin 1, the problem 
is a 9x 9 matrix, which again can be diagonalized, but for the general case we 
must diagonalize a (2s + 1)? x (2s +1)? matrix, although that is rather sparse. 
For weak coupling the anisotropic potential continues to align or anti-align the 
spins along the z-axis in the ground state. 
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As the non-interacting ground state is fourfold degenerate, in first-order 
degenerate perturbation theory, we should diagonalize the exchange interaction 
in the degenerate subspace. However, it turns out to be already diagonal in that 
subspace. The full Hamiltonian can be alternatively written as 


1 
H=—K(S?,+82,)4 A (51-52 + 515783 +57S})), (7.109) 


where S7 = Siz E iSiy for i = 1,2. S* act as raising and lowering operators 
for Siz, and hence they must annihilate the states |f,1), |4, 4}. Thus the two 
states |t,t),|1,1) are actually exact eigenstates of the full Hamiltonian with 
exact energy eigenvalue (—2K + \)s?. These two states do not mix with the two 
states |f, 4), |}, f) as the eigenvalue of S17 + S22, which is conserved, is +2s, —2s 
for the two ferromagnetic states and 0 for the two anti-ferromagnetic states. The 
perturbation, apart from the diagonal term AS12927, acting on the two states |f, | 
), i,t) takes them out of the degenerate subspace, thus this part does not give any 
correction to the energy. The action of the diagonal term on either of these states 
is equal to —As?. Thus the perturbation corresponds to the identity matrix within 
the degenerate subspace of the two states |f, 1), L, 4), with eigenvalue — As? for the 
two anti-ferromagnetic states. This yields, in first-order degenerate perturbation 
theory, the perturbed energy eigenvalue of (—2K — \)s? for the two states |f 
+), 4). Thus the following picture emerges of the first four levels in first- 
order degenerate perturbation theory. For the A < 0 (ferromagnetic coupling), 
the states |t,t),\.,) are the exact, degenerate ground states of the theory, with 
energy eigenvalue (—2K +A)s? = (—2K —|A])s?. The first excited states are also 
degenerate, but only within first-order degenerate perturbation theory. They are 
given by |t,J),\,¢), with energy eigenvalue (—2K — \)s? = (—2K +|A|)s?. For 
the à > 0 (anti-ferromagnetic coupling), the roles are exactly reversed. The states 
tt, L), I.) give the degenerate ground state with energy (—2K — A)s? in first-order 
degenerate perturbation, while the states [t,t),|,1) give the exact, first (doubly 
degenerate) excited level with energy (—2K +.)s?. Thus the Hamiltonian in 
first-order degenerate perturbation theory is simply diagonal 


-2K +À 0 0 0 
0 -2K +2 0 0 ; 
is 11 
(H) 0 0 —2K—\ 0 $ rto) 
0 0 0 ey est 


in the ordered basis {|t,t),\,1), It), i,t) }. The two ferromagnetic states are 
the exact degenerate ground states for A < 0, while the two anti-ferromagnetic 
states are the approximate ground states for \ > 0. 

However, we do not expect this result to stand in higher orders. We will 


show that, in fact, the states |+) = Well It, ) HT) are the appropriate linear 
combinations implied by higher orders in degenerate perturbation theory, for the 
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ground state in the anti-ferromagnetic case, and they are the second and third 


excited states in the ferromagnetic case. We will also show that the states |+) 


are no longer degenerate. The perturbing Hamiltonian links the state | + s, Fs) 


only to the state | +s F 1,Fs +1). To reach the state | F s, +s) from the state 


| +s, Fs) requires one to go to 2sth order in perturbation, and s is assumed to 


be large. Indeed, we find our results via macroscopic quantum tunnelling using 
the spin-coherent state path integral. Using the path integral to determine large 
orders in perturbation theory has already been studied in field theory [35, 128]. 

Our two-spin system, in Minkowski time, is governed by an action S = f dtL 
where, 


L= fesi i (3zi x atha) = Vi (fi1) 
+ [desig j (Ozhe x atd) — Va (2) — ANY “fo, (7.111) 


where now fi; = (sin 8; cos ¢;, sin 8; sin d;,cos6;) ,i = 1,2 are two different 3-vectors 
of unit norm, representing semi-classically the quantum spin [28, 27] and s is the 
value of each spin. We use the coordinate-dependent spherical-polar coordinate 
to describe the spins and the Lagrangian takes the form 
L= —s¢ġı (1 — cos01) — Vi (01,61) 
sġa(1— cos02) — V2(02, 2) 
— à (sin 01 sin 62 cos(¢1 — $2) + cos 6; cos 82) . (7.112) 


Our analysis is valid if we restrict our attention to any external potential with 
easy-axis, azimuthal symmetry, with a reflection symmetry (along the azimuthal 
axis), as in [68], Vi(0:,¢:) = V (0:1) = V(r — 9;),i = 1,2. The potential is further 
assumed to have a minimum at the north pole and the south pole, at 6; = 0, and 
m. We will treat the special simple case of the potential given by 


V (Ai) = V (0i, Qi) = K sin? 0;. (7.113) 


corresponding exactly to our Hamiltonian Equation (7.108). It was shown in 
[68], for uncoupled spins, that quantum tunnelling between the spin up and 
down states of each spin separately is actually absent because of conservation 
of the z-component of each spin. With the exchange interaction, only the 
total z-component is conserved, allowing transitions |f, 4} <> |{,¢). In general, 
tunnelling exists if there is an equipotential path that links the beginning and end 
points. We will see that such an equipotential path exists, but through complex 
values of the phase space variables. 

We must find the critical points of the Euclidean action with t > —i7, which 
gives 


Lpg = ish, (1 —cos6,) +V (01) + isd2(1 — cos62) + V (02) 
+ à (sin 6; sin 62 cos(¢1 — ¢2) + cos 01 cos 62). (7.114) 
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The solutions must start at (01,1) = (0,0) and (62, ¢2) = (m,0), say, and evolve 
to (61,¢1) = (7,0) and (02,¢2) = (0,0). In Euclidean time, the WZ term has 
become imaginary and the equations of motion in general only have solutions 
for complexified field configurations. Varying with respect to ¢; gives equations 
that correspond to the conservation of angular momentum: 


ise (1 —cos@)) + Asin} sin 62 sin (¢1 — ¢2) =0 (7.115) 

T 

i£ (1 — cos @2) — Asin 6} sin 02 sin (¢1 — $2) = 0 (7.116) 
T 


Varying with respect to 6; gives the equations: 


iso, sin 6; + 2K sin; cos; + A (cos 41 sin 62 cos(¢1 — $2) — sin 6; cos 2) = 0 
(7.117) 
isos sin 62 + 2K sin 02 cos 62 + À (cos 62 sin 6; cos(¢1 — $2) — sin 82 cos 01) = 0. 
(7.118) 
Adding Equations (7.115) and (7.116) we simply get 
a 6; + cos62) =0 (7.119) 
Gp (00881 + 0882) = 0. . 


Hence cos0ı + cos =l = 0, where the constant l is chosen to be zero using the 
initial condition 0; = 0,02 = m and therefore we can take 02 = m — 01. We can now 
eliminate 62 from the equations of motion, and writing 0 = 01, ¢= ¢, — ¢2 and 
® = ¢, + ¢2 we get the effective Lagrangian: 


L = is —isdcosd +U(6,¢), (7.120) 


where U (6,¢) = 2K sin?@+ à (sin? 0 cos ¢ — cos? 0) + is the effective potential 
energy. We have added a constant À so that the potential is normalized to zero 
at 0 =0. The first term in the Lagrangian is a total derivative and drops out. 
The equations of motion become: 


isosind = a (7.121) 
is sind = a (7.122) 


These equations have no solutions on the space of real functions 0(7),¢(7) due 
to the explicit i on the left-hand side. The analogue of conservation of energy 
follows immediately from these equations, multiplying (7.121) by 6 and (7.122) 
by ġ and subtracting, gives: 

dU (8,9) 


S os 0, i.e. U(0,b) =const. =0. (7.123) 
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The constant has been set to 0 again using the initial condition 0 = 0. Thus we 
have, specializing to our case, Equation (7.113) 

U (0,¢) = (2K +A (cos + 1)) sin? 6 =0 (7.124) 


implying (2K + (cos¢+1)) =0, since sin? 6 ¥ 0, as is required for a non-trivial 
solution. Thus 


À 


and we see that ¢@ must be a constant. This is not valid in general, it is due 
to the specific choice of the external potential Equation (7.113). Since K > |A| 


gosg = — (+1) (7.125) 


we get |cos¢| > 1, which of course has no solution for real ¢. We take ¢ = 
Or +iġr which gives cosd = coségcosh¢,; —isindrsinhd¢;. As the right-hand 
side of Equation (7.125) is real, we must have either ¢; =0 or dr = nz or both. 
Clearly the ¢; =0 cannot yield a solution for Equation (7.125), hence we must 
take r = na. As we must impose 27 periodicity on dr, only n =0 or 1 exist. 
Then we get 


EE A -(?K +1) ifA>0 
cos Ọ = cosh Ọr = +(28-1) ALO’ 


(7.126) 


Thus n = 1 for À > 0 and n=0 for À < 0 allowing for the unified expression 


2K +A 
cosh ġr = a (7.127) 
Equation (7.122) simplifies to 
Lore ta —Asing = —iA(—1)” sinh ¢; = iļà| sinh dy (7.128) 
in 


as A(—1)” = — |A|. Equation (7.127) has two solutions: positive øz corresponds to 
the instanton, (0 > 0), and negative r corresponds to the anti-instanton, (0 < 0). 
The equation is trivially integrated with solution 


6(r) =2arctan (Couey (7.129) 


where w = (|A|/s) sinh ¢; and at T = To we have 0(7) = 7/2, which has exactly the 
same form as the solution in the previous section, Equation (7.78). Thus 6(r) 
interpolates from 0 to m as T interpolates from —oo to oo for an instanton and 
from 7 to 0 for an anti-instanton. 

Using ¢ = 0 and Equation (7.123) that the effective energy is zero, we see 
that the action for this instanton trajectory, let us call it So, simply vanishes 
So= JS, dr£=0. So where does the amplitude come from? As in the previous 
section, we have not taken into account the fact that ¢ must be translated from 
¢=0 (any initial point will do, as long as it is consistently used to compute the 
full amplitude) to ¢=nz+i¢, before the instanton can occur and then back to 
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¢=0 after the instanton has occurred. Normally such a translation has no effect; 
either the contribution at the beginning cancels that at the end or, if the action 
is second-order in time derivative, moving adiabatically gives no contribution. 
But in the present case, for an instanton, before the instanton occurs, 0 = 0, 
while after it has occurred, 0 = 7, and vice versa for an anti-instanton. As $ is 
multiplied by cos@ in the action, the two contributions actually add, and there 
is a net contribution to the action. Indeed, the full action for the combination of 
the instanton and the changes in ¢ is given by 


nt+iby 0 
AS = i) —isddcos0|g=0 + So +f —isdocos0|o=r 
0 nn+ipr 
= —is2nr + 2s¢7 (7.130) 


we call it AS since it arises because of a change in ¢, and where we have put 
So = 0. 

We will use this information to compute the following matrix element, using 
the spin-coherent states |0,¢) and the two lowest energy eigenstates |Fo) and 
|E1): 


(05, osle P? lOi pi) = e797 (0r, ppl Eo) (Eo|9:, di) 


+e (0; p| E Elh p t (7.131) 
On the other hand, the matrix element is given by the spin-coherent state path 
integral 
Osos 
(Or, pple PF |0:,0:) =N DoDo e™®°E, (7.132) 
0i Qi 


The integration is done in the saddle point approximation. With (6;,¢;) = (0,0) 
corresponding to the state [t,|) and (0f, f) = (7,0) corresponding to the state 


| 4,1), we get 
(Lt lett) =Ne** xB +---), (7.133) 


where « is the ratio of the square root of the determinant of the operator 
governing the second-order fluctuations about the instanton excluding the time 
translation zero mode, and that of the free determinant. It can, in principle, be 
calculated, but we will not do this here. The zero mode is taken into account by 
integrating over the position of the occurrence of the instanton giving rise to the 
factor of 8. N is the overall normalization including the square root of the free 
determinant which is given by Ne~"0°, where Eo is the unperturbed ground- 
state energy and N is a constant that depends on the form of the perturbative 
ground-state wave function. The result exponentiates, but since we must sum 
over all sequences of one instanton followed by any number of anti-instanton— 
instanton pairs, the total number of instantons and anti-instantons is odd, and 
we get 

e~ “55,8 — sinh (e~ “5x 8). (7.134) 
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Given AS = —is2na + 2s¢,; and solving Equation (7.127) for ¢; for K > |A| 


2K 4K 
$r = arccosh (==) xln (=) (7.135) 


gives: 


Wl) ifseZ 
gistn—2edr if \>0= (Ht) F 
ASL (4) if s€Z+1/2. (7.136) 


(py if\<0 


Then we get 


(tle EN (e rB re Ga) =) Ne~®®, (7.137) 


where the — sign only applies for the case of anti-ferromagnetic coupling with 
half odd integer spin, i.e. A > 0,s = Z + 1/2. An essentially identical analysis 
yields, for the persistence amplitudes 


(tle PL treat) le Pelt) 
a (Se) con se) o Ne7B2o, (7.138) 


These calculated matrix elements can now be compared with what is expected 
for the exact theory: 


(Lot len PF tJ) =e PFo- 34) (|, + Bo) (Eolt 4) 
4e7B(Eo+ 24) (Lt | £1) (Exit, J) 


(7.139) 
and 
(Lt le Fh, t) = eiA], + |B) (Boll, t) 
+e A(Bo+ BAF) (1+ mEt) 
(7.140) 
The energy splitting can be read off from this result 
AE = E — Ep =2 Gi g (7.141) 
SL Ea R . 


for all cases; however, the wave functions are different. The low-energy eigenstates 
are given by 


| Eo) = z% ‘PHH lE) = ZU st) - th) (7.142) 
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for the case À > 0 for s € Z, where they are the actual ground and first excited 
state as well as for the case À < 0 (although here these energy eigenstates should 
be labelled |E} and |£4) as the actual ground states are the ferromagnetic 
states [t,t),\.,1)). For the fermionic spin, anti-ferromagnetic case with À > 0 and 
s€Z+4+1/2 we get the reversal of the states 


1 


v2 


but the energy splitting remains the same. 


1 


|Eo) = z 


P-E E) = a(t) +), (7.143) 


This understanding of the ground state in the anti-ferromagnetic case is the 
main result. This difference in the ground states for integer and half-odd integer 
spins is understood in terms of the Berry phase [88, 38] (computed by the change 
in the WZ term) for the evolution corresponding to the instanton. It can also 
be understood by looking at perturbation theory to order 2s; the details cannot 
be given here. Briefly, the action of the perturbation Equation (7.109) will lower 
one spin and raise the other. This can be done 2s times when we achieve a 
complete flip of both spins. We find that the effective 2 x 2 Hamiltonian for the 
degenerate subspace is proportional to the identity plus off-diagonal terms that 
are symmetric. For the integer spin case the off-diagonal terms are negative and 
for the half-odd integer case they are positive. Diagonalizing this 2 x 2 matrix 
gives the solutions for the ground states, exactly as we have found. 


7.8 The Haldane-like Spin Chain and Instantons 


The study of quantum spin chains has been a very important physical problem in 
condensed matter and mathematical physics over the past 100 years. They play 
an exemplary role in the study of strongly correlated quantum systems. In both 
experimental and theoretical physics, models of quantum spin chains are one 
of the most fundamental systems endowed with interesting phenomenon. The 
classic work on spin chains was that of Bethe [14] and Hulthén [63] for the one- 
dimensional (D = 1), isotropic Heisenberg spin-4 anti-ferromagnetic chain. They 
computed the exact anti-ferromagnetic ground state and its energy for an infinite 
chain. Anderson [6] worked out the ground-state energies and the spectrum for 
D = 1,2,3 by means of spin wave theory. The inclusion of an anisotropy term 
introduces much interesting physics ranging from quantum computing [90] to 
optical physics [110]. The resulting Hamiltonian is what we will study in this 
section. It now possesses two coupling constants which can compete against each 
other to lower the energy 


N N 
A =-KXY S, +AX Si Sig (7.144) 
i=l 


i=l 
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and we consider the chain with periodic boundary conditions and consider A > 0 
so that we are in the anti-ferromagnetic regime, which is the more interesting 
regime. 

This model is the generalization to a spin chain of the two-spin model that we 
studied in the previous section. Here each nearest-neighbour pair corresponds to 
the two-spin system that we have just studied. Each spin has magnitude KA =s 
and we will consider the large s limit. The two limiting cases are weak anisotropy 
A >> K and weak exchange coupling \< K, where A is the Heisenberg exchange 
interaction coupling constant and K is the anisotropy coupling constant. The 
limit of weak anisotropy was studied in a celebrated paper by Haldane [59] in a 
closely related model, hence we call our model a Haldane-like spin chain. Haldane 
demonstrated that in the large spin limit, s >> 1, the system can be mapped to a 
non-linear sigma model in field theory with distinguishing effects between integer 
and half-odd integer spins. The full rotational symmetry is broken explicitly into 
rotational symmetry about the z-axis with the total z-component Si z = J; 9j,z 
conserved. The Hamiltonian also possesses a discrete reflection symmetry about 
the z-axis S;,,— —S;,,. We will also study the model in the large spin limit, but 
we will take the limit of strong anisotropy, K > A, the opposite limit to Haldane. 

With \=0, the ground state is 2‘-fold degenerate, corresponding to each spin 


in the state S, = +s. Then s? = s? and the energy is —Ks?2N, which is minimal. 
For an even number of sites, the model is called bi-partite and the two fully anti- 
aligned Neél states are good starting points for investigating the ground state. 
For an odd number of sites, the Neél states are frustrated; they must contain 
at least one defect, which are called domain wall solitons [115, 39, 93, 20, 95]. 
There is a high level of degeneracy as the soliton can be placed anywhere along 
the cyclic, periodic chain and this degenerate system is the starting point for 
investigating the ground state for the case of an odd number of sites. Frustrated 
systems are of great importance in condensed matter physics as they lead to 
exotic phases of matter such as spin liquid [9], spin glasses [15] and topological 
orders [73]. Solitons will also occur on the periodic chain with even number of 
sites, but they must occur in soliton—anti-soliton pairs. 

Many physical magnetic systems such as CsNiF3 and Cott have been 
modelled with Hamiltonians of the form of Equation (7.144). Models of this 
form have been of research interest over the years since the work of Haldane [59]. 
To mention but a few, quite recently, the ground-state phase diagrams of the 
spin-2 X XZ anisotropic Heisenberg chain has been carefully investigated by the 
infinite system density-matrix-renormalization group (iDMRG) algorithm [74] 
and other numerical methods [58]. For the spin-1 X XZ anisotropic Heisenberg 
chain, the numerical exact diagonalization has been extensively investigated 
for finite size systems [25]. For an arbitrary spin, the phase diagrams and 
correlation exponents of an X XZ anisotropic Heisenberg chain has also been 
studied by representing the spins as a product of 2s spin $ operators [108]. This 
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research has been focussed on ground-state phase diagrams and the existence of 
Haldane phase (conjecture). In this section, we will study the spin chain with 
Hamiltonian given by the simple form given in Equation (7.144) with periodic 
boundary condition g N+1= Si; and we consider K > A > 0, i.e. strong easy-axis 
anisotropy and perturbative Heisenberg anti-ferromagnetic coupling. In the first 
subsection, we will study macroscopic quantum tunnelling of the Hamiltonian- 
defined Equation (7.144) for the case of an even spin chain. This analysis is 
based on spin-coherent state path integral formalism, which is appropriate for 
large spin systems. In the second subsection, which we include for completeness, 
we will deal with the case of an odd spin chain. Here, spin-coherent state path 
integral formalism fails to give a definitive result. Thus, our analysis is based on 
perturbation theory. 


7.8.1 Even Number of Sites and Spin-Coherent State Path Integral 


Let us consider our model, Equation (7.144), for N even. The ground state of 
the free theory (K term) is 2 -fold degenerate corresponding to each spin in the 
highest (lowest) weight states m = +s. In the degenerate subspace, there are two 
fully aligned states |t,7,1,7,---,7,4) and |J,J,J,1,---,J,1) and two fully anti- 


aligned Neél states |p) m= Mostky et Pape and |—p) = ss ea Vee stat where 
the arrow denotes the highest (lowest) weight states, i.e. m = s =¢(m = —s =|) 


for each individual spin and the remaining degenerate states are produced by 
flipping individual spins relative to these extremal states. These two Neél states 


+p) have the lowest energy at first-order in perturbation theory; however, they 
are not exact eigenstates of the quantum Hamiltonian in Equation (7.144), 
thus we expect ground-state quantum tunnelling coherence between them. Such 
tunnelling is usually mediated by an instanton trajectory, and the exponential of 
the instanton action (multiplied by a prefactor) yields the energy splitting. We 
will obtain this instanton trajectory via the spin-coherent state path integral 
formalism [5, 76, 75, 99], which is the appropriate formalism for large spin 
systems. In this formalism, the spin operators become unit vectors parameterized 
by spherical coordinates. The corresponding Euclidean Lagrangian in this 
formalism is given by 


N N 
Lpg =is X ġ;i(l — cos0;) +KX sin’ð; 


i=l i=l 
N 

+. 5 (sin 6; sin 6;41 cos(¢; — Pi+1) + cos; cos i41), (7.145) 
i=l 


where the periodicity condition i= N + 1 = 1 is imposed. The first term is the 
usual WZ [120] term which arises from the non-orthogonality of spin-coherent 
states while the other two terms correspond to the anisotropy energy and the 
Heisenberg exchange energy. Quantum amplitudes are obtained via the path 
integral and the saddle point approximation. Solutions of the Euclidean classical 
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equations of motion give information about quantum tunnelling amplitudes. The 
Euclidean classical equation of motion for ¢; is 
1—cos0; 
jp toes) = sin ĝ;—1 sin ĝ; sin(dj-1 oa Qi) 
dt 
—sin 6; sin O41 sin(d; = oi41) (7.146) 
while the equation of motion for 6; is 
isd; sin; + 2K sin 8; cos 6; 
A (cos 6; (sin 6:4 cos(@; — $:41)) + sin8;_1 cos(¢;_1 — ¢;)) 
=À (sin 6; (cos O41 + cos6;-1)) =0. (7.147) 


Summing both sides of os (7.146) gives 
a -o =0> D =l= (7.148) 


which corresponds to the conservation of the z-component of the total spin 
X; 57, as the full Hamiltonian, Equation (7.144), is invariant under rotations 
about the z-axis. 

We will solve these equations using simplifying, physically motivated ansatze. 
A particular solution of Equation (7.148) is 02,1 = 0, and 62, = T — 9, 
k =1,2---,N/2. Making the further simplifying ansatz œ; — ¢j41 = (—1)'t!d 
effectively reducing the system to a single spin problem, we get the effective 
Lagrangian (adding an irrelevant constant) 


N/2 
Lif! =s du inom (dor—1 — bar) 
= k=1 
5 [ean + cos(¢j = ra) sin? 0 (7.149) 
i=1 
=isNé — Z deosd+ Vers, (7.150) 


where Ue = N[K +A(1+cos¢@)]sin? 0. The spin chain problem has reduced to 
essentially the same problem we studied in the previous section with just two 
spins. The instanton that we will find must go from 0 =0 to 0 = m. Conservation 
of energy implies 0 Uesf =0, which then must vanish, Ueff =0, since it is so at 
#=0. This implies 


cos @ = — (+1) <-1 (7.151) 


since sinĝ (T) £0 along the whole trajectory. Thus ¢ is a complex constant which 
can be written as ¢ = m + id; identical to that of the two-spin case [99]. The 
classical equation of motion for @ gives 


is = —2\sin sin d = i2AsinO@ sinh ¢, (7.152) 
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which integrates as 
6(r) =2arctan (em) , (7.153) 


where w = (2A/s)sinh¢;. The instanton is independent of the number of spins 
and only depends on the initial and the final points. As found in [99], the 
instanton contributes to the action only through the WZ term, as Uers = 0 
all along the trajectory. The action is given by [99] 


is N T+iby isN 0 
Se = So — E dd cos 6|9—9 — 2o ddcos6|9=n 
2 Jo ae pee 
=0-isNxr+Ns¢; =—isNr+Ns¢y. (7.154) 


The two Neél states reorganize into the symmetric and anti-symmetric linear 
superpositions, |+) and the |—) as in [99]. The energy splitting is then 


Ns 
AE =2De~* =2D (x) cos(s Nm) (7.155) 


where D is a determinantal pre-factor which contains no À dependence. The 


Ns N 
» z 


degenerate perturbation theory in the interaction term. The energy splitting, 


factor of signifies that this energy splitting arises from 2s ( ) order in 
Equation (7.155), is the general formula for any even spin chain N. For N =2, 
we recover the results obtained previously [29, 30, 71, 72, 99]. The factor sN can 
be even or odd, depending on the value of the spin. For half-odd integer spin 
214+ 1)/2 and for N = 2(2k +1), the argument of the cosine in Equation (7.155) 
is sNa = (2l + 1)(2k+1)z and hence we find AEF is negative, which means that 
—) is the ground state and |+) is the first excited state. In all other cases, for any 
value of the spin s and N = 2(2k) the argument of the cosine is s Na = (2s)(2k)z, 
which is an even integer multiple of m and hence we find AF is positive and then 
+) is the ground state, |—) is the first excited state. 


7.8.2 Odd Spin Chain, Frustration and Solitons 


We include the analysis of the spin chain with an odd number of sites for 
the sake of completeness. This system can, in principle, be analysed using the 
spin-coherent state path integral. However, the tunnelling transitions are quite 
different, and no explicit, analytic expressions for the instantons that are required 
are known. In this situation, we revert back to the calculation using perturbative 
methods, which is actually quite interesting. 

When we consider a periodic chain with an odd number of sites, a soliton- 
like defect arises due to the spin frustration. The fully anti-aligned Neél-like 
state cannot complete periodically, as it requires an even total number of spins. 
Thus there has to be at least one pair of spins that is aligned. This can come 
in the form up—up or down—-down while all other pairs of neighbouring spins 
are in the up-down or down-up combination. As the total z-component of the 
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spin is conserved, these states lie in orthogonal super-selection sectors and never 
transform into each other. The position of the soliton is arbitrary thus each 
sector is N-fold degenerate. In the first case the total z-component of the spin is 
s while in the second case it is —s. We will, without loss of generality, consider 
the s sector. These degenerate states are denoted by |k), k=1,---,N, where 
KEI Lette, ety) (7.156) 


tT, 
soa 
k,k+1th place 


in obvious notation. These states are not exact eigenstates of the quantum spin 
Hamiltonian in Equation (7.144), thus we also expect ground-state quantum 
tunnelling amongst these states, just as in the case of a particle in a periodic 
potential, which would lift the degeneracy and reorganize the soliton states into a 
band. The explicit form of the required instanton, which from the spin-coherent 
state path integral would give rise to the appropriate tunnelling, is not known. 
However, from Equation (7.155) we can see for the case of even spins that energy 
splitting actually arises at the 2s (X) order in (degenerate) perturbation theory. 
The path integral and instanton method only gives the result which must also 
be available at this high order in degenerate perturbation theory. This indicates 
that the appropriate formalism for the odd quantum spin chain would simply be 
(degenerate) perturbation theory at high order. 
It is convenient to write the Hamiltonian as 


Î = +V (7.157) 


where Ĥo represents the K (free) term and V represents the \ (perturbative) 
term. The states in Equation (7.156) all have the same energy Es = — K Ns? from 
Hp and in first-order degenerate perturbation theory Es =—K Ns?—)(N—1)s?4 

As? = (—K —\)Ns?+2As? and are split from the first excited level, which requires 
the introduction of a soliton/anti-soliton pair, by an energy of 4A. As we take 


the limit K >> A, we assume that the action of lowering or raising the value of ô, 
incurs an energy cost proportion to K which is much more energy than creating 
a soliton/anti-soliton pair, which has an energy cost proportional to A. Although 
the soliton/anti-soliton states are the next states in the spectrum, they cannot be 
attained perturbatively, except at order 2s in perturbation theory. In each order 
of perturbation theory less than 2s, the degenerate multiplet of states mixes 
with the states of much higher energy, but due to invariance under translation, 
the corrections brought to each state are identical and their degeneracy cannot 
be split. However, at order 2s, the degenerate multiplet is mapped to itself. 
Although the state of an additional soliton/anti-soliton pair is also reached at 
this order, since it is not degenerate in energy with the original multiplet of N 
states, its correction will be perturbatively small. 

Reaching the degenerate multiplet at order 2s causes the multiplet to split in 
energy and the states to reorganize into a band. Indeed, V2 contains the term 
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(Shad Siro 


close to the position of the soliton, which when acting on the ket |k}, flips the 


)?s and (Sj_,S;, )?s. These operators represent quantum fluctuations 


anti-aligned pair of spins at positions k+1,k+2 and at k—1,k, respectively. It is 
easy to see that flipping this pair of spins has the effect of translating the soliton 
|k) + |k+2) and |k) > |k — 2), respectively. All other terms in V?° are quantum 
fluctuations away from the position of the soliton. They map to states out of the 
degenerate subspace, either inserting a soliton/anti-soliton pair or changing the 
value of S* to non-extremal values, and hence do not contribute to breaking 
the degeneracy. 

To compute the splitting and the corresponding eigenstates, we follow [30]. 
We have to diagonalize the N x N matrix with components bp,» given by 


buv = (ÝA), pv =1,2 N (7.158) 


aN 2 


s—1 
where A?s-1 = a 2 V , and Q = 1 — `` |u) (u|. These matrices are a 


generalization of the 2 x 2 matrix in [30]. The calculation of the components is 


straightforward, for example, looking at b„,ı we find 


À 2s a Q ae 2s—1 
ba =(3) (al S2 S3 (5 S3 |1) 


s~ 440 
À 2s Q 2s—1 
+ (= Shop (2 sts;) 1). 7.159 
(3) sts (Egs) D aso) 
Applying the operators 2s times on the right-hand side we obtain 


bur =C[(u]3) + (uN — 1)], (7.160) 


where C is given by 
À 2s 2s 2s—1 1 
=+ļ|-—- 2s — 1 —— 
e) M eE 


=k (4) cn i a ey (7.161) 


The first product in Equation (7.161) comes from the two square roots that 


accompany the action of the raising and lowering operators, and the second 
product is a consequence of the energy denominators. The plus or minus sign 
arises because we have 2s — 1 products of negative energy denominators in 
Equation (7.159), so if s is integer, 2s — 1 is odd and we get a minus sign, while 
for half-odd integer s, 2s — 1 is even and we get a plus sign. Similarly, we can 
show that b, =C[(u|v +2) + (uly — 2)], defined periodically of course. Thus we 
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find that the matrix, [b,n], that we must diagonalize is a circulant matrix [37] 


0 0 1 0 1 0 
0 0 0 1 0 1 
1 0 0 0 1 0 
ae 162 
Puwl=Cl: 4 4 (7.162) 
1 0 0 0 
0 1 1 0 0 0 


In this matrix each row element is moved one step to the right, periodically, 
relative to the preceding row. The eigenvalues and eigenvectors are well known. 
The jth eigenvalue is given by 
— 2 N-1 
Ej =b11 + 61,205 + b1,305 +++ +b, nw; ; (7.163) 
277 

where wj = eiT is the jt , N* root of unity with corresponding eigen- 
vector |272) = (L wj wz TAEDE for j = 0,1,2,---,N—1. For our matrix, 
Equation (7.162), the only non-zero coefficients are bı 3 and bı, y—1, thus the 
one-soliton energy bands are 


Ej = Clo +w?) = Clos +w;”) 
Arj 
=2C — |. 7.164 
cos (2) (7.164) 
Introducing the Brillouin zone momentum q = ja/N, the energy bands 
Equation (7.164) can be written as 


Eq = 2C cos (4q) (7.165) 


which is gapless but is doubly degenerate as the cosine passes through two periods 
in the Brillouin zone. The exact spectrum is symmetric about the value N/2. 
With [x] the greatest integer not greater than x, the states for j = [N/2] — k and 
j=([N/2]+k+1 for k=0,1,2,--- ,[N/2]—1 are degenerate as cos (mega) = 


cos (sgan) since [N/2] = N/2— 1/2. However, the state with k = [N/2] 
is not paired, only j =0 is allowed. When s is an integer, C is negative and the 
unpaired state j = 0 is the ground state which is then non-degenerate, but for s a 
half-odd integer, C is positive, and the ground states are the degenerate pair with 
j =([N/2],[N/2] + 1 in accordance with Kramers’ theorem [78]. However, in the 
thermodynamic limit, N — oo, the spectrum simply becomes doubly degenerate 
for all values of the spin and gapless. 


8 


Quantum Electrodynamics in 1+1 Dimensions 


8.1 The Abelian Higgs Model 


Instantons imply drastic changes in the spectrum of theories with essentially 
Abelian gauge invariance in 1+ 1 and 2+ 1 dimensions. We say essentially 
Abelian, since we include in this class theories which are spontaneously broken 
to a residual U(1) invariance. In 1+1 dimensions we consider the theory defined 
by the Lagrangian density [61], 


* A * * v 
L= (D, (D"A -FA — Gto- Fa eE, (8.1) 
where 
Do =O,O+tApd 
Fy = ô Ay — p Ap. (8.2) 


We take D, =ô p +ieA Q, but we have replaced A, > lAn. The Lagrangian is 
invariant under a local gauge transformation which has a natural multiplication 
law corresponding to the group U (1) 


p> Aedh = gx, tjo glx, t) €U(1) 
Ay > EACD (A, — i8) eÀ = A, — 0,,A. (8.3) 


Then 
Dud 0, (EeP) + (Ay — ON (at)) OG 
= iAd b+ 910, Ala toH AOA, o e'4@ 10, A(z, to 
= FAG) Dd. (8.4) 


We impose that limj,),..g(#,t) = 1. This gives an effective topological 
compactification of the space since the gauge transformation at spatial infinity 
must be the same in all directions. 
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Figure 8.1. The symmetric breaking potential U(|¢]) 


There are two cases to consider depending on the sign of u?. For p? > 0 the 
quadratic part of the Lagrangian is 


2 
* x 1 3 
L= (Lh) (OG) — To o- gatet" (8.5) 
with corresponding equations of motion 
0,0" b+ po =0 
ð FRY =0, (8.6) 


which describe a free, massive, scalar particle and a massless vector field, the free, 
electromagnetic field. The conserved current corresponding to gauge invariance is 


“= 6*0" G — (ONO") Q. (8.7) 


External charges that are well separated experience the usual Coulomb force. 
This is true in any dimension except in 1+ 1 dimensions, the case that we are 
considering first. Here, the Coulomb force is independent of the separation and 
it costs an infinite amount of energy to separate two charges to infinity. We 
say that charges are confined. Furthermore, there is no photon. There is no 
transverse direction for the polarization states of the photon. The spectrum 
consists of bound states of particle-anti-particle pairs, which are stable. They 
cannot disintegrate since there is no photon. 

For the other case with u? < 0, the potential (as depicted in Figure 8.1) is of 
the symmetry breaking type 


uleh = igli - gta 8.8 
loD = flo- E +0, (8.8) 
where the C is adjusted so that the potential vanishes at the minimum. The 
minimum is at |¢|? = br We fix the gauge so that Im(¢) = 0, and we write 
aie (8.9) 


vx 
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with 7 € R. Then we get the Lagrangian density 


£=(3, ia (H | n) (Ə! +iA") (Aen) 


JEL y “2 (ll) oe pow (8.10) 
AYR BAYA) gee : 


which yields the quadratic part 


2 
H peaca 1 
Lo = pno” N + 7 A, A" + pn Iz 


This now corresponds to a scalar particle with a mass of Jz and a vector particle 


Fp F”. (8.11) 


of mass Re: Then the expectation is that the potential between particles should 
drop off exponentially with the usual Yukawa factor 

eM (8.12) 
with M= 4 or M = He. We will find, surprisingly, that this is again not true in 
1+1 dimensions. Instantons change the force between the particles and actually 
imply confinement. The only difference between the cases u? > 0 and u? <0 is 
that the force is exponentially smaller (in A) for the case u? < 0; however, it is 
still independent of separation. 


8.2 The Euclidean Theory and Finite Action 


To see this result, we must analyse the Euclidean theory. Here the Lagrangian 


density is 
1 j 5 Nees 2 
L= TPF" +(D,¢) (DY) +7 ($ p-a?) (8.13) 
adding a constant, where a = d, These are three positive terms. For a 


configuration of finite Euclidean action, each term must give a finite contribution 
when integrated over R°. This implies that ¢*¢ > a?, D, —> 0 and Fy + 0 
faster than L, 


$> a? => lim ġ=g(0)a 


z: 1 
Fy, > 0> iA, G0, (9) ‘+0(4) = ið A 
A 


for g=e' 
Dye > 0 = Oug(P)at G2, (F) g(A)a 
= (—9(0)8, g (0) +50, (3) *) g(@)a=0. (8.14) 


This is satisfied if g(?) = g. Thus the configurations with finite Euclidean 
action are characterized by g(@). g(@) defines a mapping of the circle at infinity 
parametrized by @ into the group U(1), which is just the unit circle as in 
Figure 8.2. 
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Figure 8.2. Mapping the spatial circle at oo to the circle in U(1) 


8.2.1 Topological Homotopy Classes 


The space of such maps separates into homotopically inequivalent classes. These 
classes are characterized by the winding number of the map. A map from one 
class cannot be continuously deformed to any other map from another class. It is 
intuitively obvious that there are an infinite number of classes each corresponding 
to a winding number. We can take 


O= — n=v. (8.15) 


v= — aog(0) <9" (0). (8.16) 


20 
v= =| gO) (iv) g! (0) = —2n =v. (8.17) 
0 27 
If we make an arbitrary, infinitesimal change in g(6), 
g(0) => eA g (A) = g() +icA(0)g(0) (8.18) 


with A(@) of compact support in [0,27), 


) 
re d d 4 
TOOT (0) ARa Oe aa a 
= iA(O)g(0) g0) ~1N(0)9(0)-91(0) -iZ (0) 
d 
= igh). (8.19) 
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Thus 
w= f aw CETO) =E f 0A) 
Š ~ (A(2m) — A(0)) =0. (8.20) 


Thus for each class, v is an invariant under arbitrary continuous deformation. 
Furthermore, if g(0) = gv, (@)gv.(@), then 


v= 2 : EAO Of TOPAT 
Saf, Pon O (OBO) Oa O gO 
0 1 v2 do?"2 vi vi 79" 
=n, + n2. n 


Finally, using iA, = gô, g! +0 (5) 


27 . 2m 
d i i . 1 
d0g() gI (@) = on Jy drif pEuv Av = a dty,Ap 
o 1 2 al 2 _ ® 
= oe fe LOnE py Av = ee fe LE py Pw =— (+) 5 (8.22) 


giving that the flux is quantized in units of 27. In each homotopy class, the 


y= = 
20 0 


T=CO 


configuration of minimum action must be stationary and hence satisfy the 
Euclidean equations of motion. Because the solutions with different v cannot 
be obtained from each other by continuous deformation, there should be an 
infinite action barrier between each class. 


8.2.2 Nielsen—Olesen Vortices 
The solutions for each v are known to exist and are called the Nielsen—Olesen 
vortices [96]. They are described by two radial functions, for v = 1 


atr) 


olr) =e f(r). (8.23) 


This form implies the equations 


Ay =€yoTv 


-Hi (ro Gm) GSO) oraes e2 


rdr r2 Qn 
i Ld (1480) P) (2) 
e? dr G dr 2r )+ r ( 27 1) mi 48:28) 


A solution exists, as depicted in Figure 8.3, with the behaviour for the magnetic 
field B(r) 


ee ot) 


B= 1 d Or) 


~ dnr dr 2r 


148 Quantum Electrodynamics in 1+1 Dimensions 


Figure 8.3. The form of the function f(r) and the magnetic field B(r) 


f(r) 74 
B(r) +, (8.26) 


where ®(r) has the interpretation of being equal to the magnetic flux inside the 
radius r while ® is the total magnetic flux in the soliton, which is quantized in 
units of 27. The magnetic field is concentrated around the origin and both fields 
approach the vacuum configuration exponentially fast with a non-trivial winding 
number. 

This solution mediates tunnelling between inequivalent classical vacua, which 
correspond to classical configurations with zero energy. The energy is given by 
(for Ap = 0) 


2 


= I dir (041)? + (800)" (3P) + (Did)" (Did) +A (9*9 —a?)”. (8.27) 


The simplest zero-energy configuration is ¢= a, A, = 0. There exists, however, 
the freedom to transform this solution by a local gauge transformation that 
depends only on space, which keeps the gauge condition Ap = 0 invariant, 


o> g(a)a A; > —ig(x)0,g~ +(x). (8.28) 


We impose the additional condition, the lim, g(x) > 1, which is consistent 
with our desire to consider a theory with arbitrary local excitations but 
asymptotically no excitations. Then we get the effective compactification of the 
spatial hypersurface. Topologically it is now just a circle and g(x) again maps 
the circle that is space onto the circle in U(1). These maps are characterized by 
winding numbers. Thus the classical vacua 


b=g(x)a d 
A = igu (2) zI (2) (8.29) 


indexed by v € Z are homotopically inequivalent. We cannot deform one into 
another while staying at € = 0. However, the energy barrier between them is 
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Figure 8.4. The form of the gauge transformation A(0) 


not infinite. The Nielsen—Olesen vortices interpolate between these vacua. To 
see this we must transform the Nielsen—Olesen vortex into a form suitable for 
the description in terms of many vacua, i.e. to the gauge Ap = 0. 

We first perform a gauge transformation g(@) = e~““), which has the limit at 
spacetime infinity given by 


6 for 6 € (0,7/4) 
7/40 for 0 € (1/4,7/4+€) 
A(@)=40 for 0 € (n/4+¢€,3m/4—€) (8.30) 
0 —5r/4 for 0 € (87/4 —€,37/4) 
(0 — 27) for 0 € (37/4, 27) 


as drawn in Figure 8.4. The corresponding g(@) is topologically trivial; we can 
simply deform the two saw-tooth humps to zero (non-trivial winding number 
requires a A(@) discontinuous by 2nmr between its value at 0 = 0 and 0 = 
27). Therefore, the gauge transformation can be continued everywhere inside 
the spacetime and define a gauge transformation at all points. This gauge 
transformation (note this is an inverse transformation, A + —A) takes 


Ay > Ay = Ap + O,A(8) (8.31) 


and it is easy to see that this vanishes asymptotically, except where A(0) =0, i.e. 
for 0 € (7/4+€,30/4—€). Thus A, > 0 for t € [-00,T], T finite, exponentially 
fast as |x| — oo, as depicted in Figure 8.5. 


Now we further perform the gauge transformation to put Ao = 0 everywhere; 
this is easily implemented by the choice 


A(x,t) -f dt! Ão(x,t') 


—o0oO 
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Figure 8.5. The regions of spacetime where the gauge field vanishes 


Ay(2,t) > Ay(z,t) = A, (x,t) — O,A(z,t) = A, (x,t) — ô, J at! Ao (x,t). 
(8.32) 


This makes Ap = 0 everywhere, but maintains A, > 0 for t € [—o0,T], T finite, 
exponentially fast as |z| — co, since both A; — 0 and Ap > 0 exponentially fast 
at spatial infinity in the region t € [—00,T], due to the first gauge transformation. 


Thus as t > —oo, A, = 0, but as t > +00 we have, 
A > g(z)ðsg (2) (8.33) 


with i 
g(x) E e Tzw Ww Aot) 9g) ¢i9 (8.34) 
where g(0) = e~*4 is our first gauge transformation and eb Zo at’ Aolat) ig 
the second gauge transformation that put Ag = 0. The final factor e’” is the 
asymptotic gauge transformation of the Nielsen—Olesen vortex. The first two 
factors are topologically trivial gauge transformations: in each case the exponent 
can be continuously switched to zero, thus the winding number of the gauge 
transformation e’’, which is 1, is unchanged. However, the two trivial factors 
manage to bunch all of the non-trivial winding of e’? into the spatial line x at 
t=. 
Thus we have put the Nielsen—Olesen vortex in a gauge where it interpolates 
from the vacuum configuration g(x) = 1 at t = —oco to the non-trivially 


> OO 1A 1 P 
transformed vacuum configuration g(x) = e'/-%0 Aole-t )g(0)ei®, The situation 
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is exactly analogous to the problem of a periodic potential on a line. The 
classical vacua form a denumerable infinity of local minima indexed by the 
winding number n. There is a finite energy barrier between each one, and the 
Nielsen—Olesen vortex is the instanton that mediates the tunnelling between 
them. 


8.3 Tunnelling Transitions 


We can calculate the matrix element 


vios eur s 
(v=nle— ae * ly =0) =v f D (A1, ġ*, p) T (8.35) 
[din]=0 

in the semi-classical approximation. The functional integral is simply identified 
with the integral over all finite action field configurations with v = n. The 
continuation from Euclidean space automatically projects on the vacuum in this 
sector. The critical point of the action contains the vortex with v = n; however, 
this is not the most important configuration. The most important configurations 
correspond to n+ vortices with v = 1 and n_ vortices with v = —1, widely 
separated, such that n} — n- =n. The action for such a configuration is very 
close to (ny +n_)S”(v =1). The entropy factor, counting the degeneracy of the 

configuration, is n 
n ne 
es ee (8.36) 

n+!n_! 
In comparison, for a single vortex with v =n, the action is presumably smaller, 
but the entropy factor is just T.L, since there is only one object. Thus the dilute 
multi-instanton configurations are arbitrarily more important as TL — oo. Then 
1 M4 SOON =00 ntn- 
(v=nle w a |v =0) =Ndet, ? 5 5 RA mg (THe h fx) 
n4=0 n_=0 

X Ong -n,n (8.37) 


where K~? (so that it appears in the formula as just K) is given by the ratio of 
the determinant prime corresponding to the quadratic part of the Lagrangian in 
the presence of one vortex, divided by the determinant of the free quadratic part 
(written as deto), and the Jacobian factors from the usual change of variables 
that take into account zero modes. The prefactor is set equal to one by choosing 
the normalization M. Now using the formula 


a alee 
ao = — | dhed 8.38 
b= e (8.38) 
we get 
A 1 2T ; pg in4}0p—in—0 sE n4+n 
(v=nle R lv = 0) = — doe’” x eee ee bee ie 
2T Jo n4!n_! 


n+,n—=0 
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Figure 8.6. Creation of a pair of charges at the origin, separation by R, held 
for time T’ and then annihilated 


=| de® (v=nl6)(O\v =0). (8.39) 
0 


Thus we find the infinite set of classical vacua rearrange themselves to form a 
band of states parametrized by 0 with energy density 


0 sf 
An =—2Ke7 F cos (8.40) 
and the matrix element 
ind 
(v=n|6) == (8.41) 


8.4 The Wilson Loop 


This rearrangement of the vacua has important consequences for the force 
between charges. Consider the creation of an external charged particle and anti- 
particle pair. We create them at the origin, separate them by a large distance 
R, hold them at this separation for a long time T’, and then we let them come 
together and annihilate, as depicted in Figure 8.6. A particle of charge q, in an 
electromagnetic field, experiences an additional change to its wave function by 
the factor 


ete S trp An | (8.42) 
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Consider external charges governed by a dynamics with a Lagrangian 


L= La? qti Ai — qAo — V (a). (8.43) 
The equation of motion is 
#; +qÅ;(21) — 9450; Å; (21) + 90; Ao (21) + O:V (a1) = 0, (8.44) 
which can be rewritten 
Ëi — qtjEjikBk(xı) = —0;V (a1) + qEi(21), (8.45) 


where E(x) = ðA; (a1) +0;A (27) is the electric field and B; (xı) = €;ihO; Ag (21) 
is the magnetic field. Thus the action in the functional integral for the particle 
is augmented by the term 


0 aD > 
ete y eiT e~it S dt(t:Ãi(e1)-4A0(21)) 


gO. 
= eiT eta S de" Ay, (8.46) 
For an anti-particle the additional factor is, of course, 
ee f dz" An, (8.47) 


Thus for our trajectory, the additional factor becomes a closed integral in the 
exponent, 


eter del An. (8.48) 


We perform the functional integral over the gauge and scalar fields treating our 
particles as external, with their dynamics controlled by V (xı). However, the wave 
functions of the particles will change by this additional factor, which we must 
take into account. When we integrate over A,,,¢,¢* we obtain the matrix element 
of the operator (in Euclidean space) 


W =e eS tun, (8.49) 


This is called the Wilson loop operator. The matrix element of the operator 
behaves approximately as 
Wwe ET (è), (8.50) 


If E(R)~ CR for some constant C, the interaction between the charges is said to 
be confining, and the expectation value of the Wilson loop operator will behave 
like 

(W) ~ e704), (8.51) 


where A is the area of the loop. This is the celebrated criterion of area law 
behaviour of the Wilson loop for confining interactions. If, on the other hand, 
the E(R) ~ D for some constant D, we get 

(W) ~ e PPC), (8.52) 


where P is the perimeter of the loop. Such behaviour of the Wilson loop does 
not imply confining interactions. 
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Figure 8.7. A dilute gas of instantons and anti-instantons surround the Wilson 
loop 


8.4.1 Expectation Value of the Wilson Loop Operator 
Thus we wish to evaluate 
[D(A is d, ¢*) SF iv yy 
[Dd ( (Ay, @, o*) = sE eivd 


in the semi-classical approximation. For the numerator we divide the summation 


(0| W |0) = (8.53) 


over the positions of the vortices (instantons) and the anti-vortices (anti- 
instantons) into those inside the loop and those outside the loop, as depicted 
in Figure 8.7. We drop the contribution from vortices situated on or near the 
boundary; these form a negligible part of the set of all configurations, if the size 
of the loop is much larger than the size of the vortices. 
The integrand splits neatly into a part from outside and a part from inside 

the Wilson loop 

S = goutside , ginside 

y = poutside | „inside (8.54) 
however, 


inside 


W = erian S (8.55) 


Inside the volume available is RT’, while outside the volume available is LT — 
RT', for each vortex. We sum independently over the vortices and the anti- 
vortices, inside and outside the loop, with no constraint on their numbers. The 
contribution inside has 0 > 0 + 2a, thus we get 


sE 
a T ((LT—- RT’) cos0+RT" cos(0+ 3a) reosa)) 
(| W |0) 


_ Sy 1 2r 
2Ke h RT (- cos 6+cos(6+=24 = )) 
=e (8.56) 
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Then comparing with Equation (8.50) we get 


2 
E(R) =2R (cost — cos (0 i ma) ) Ker (8.57) 
and hence 
E(R)~R (8.58) 
implying confinement. 


We can also calculate 


1 2 
ZIT d x (0| euv Fu |0) 
3& 
4T oe An, 0") ve ae ee 
sE 
T iD And, p*) Je- E eiro 


en foi (A,,,4,¢°) - 5 ive 
OPT d0” En 
Ar d 


= opr ag (ere F LT cos) 


sE 


= —i4n Ke” T sin(6). (8.59) 


1 
(6| zmt uv I0) = 


For small 0 from Equation (8.40), removing a constant, we have 


6 së 
_ =K Te. (8.60) 


Also, 


sf 
(0| Fiz 10) = —i4rKe Tt 0 
2 ak _ se 
E(R)=2R 04 — | 0+ — Ke t. (8.61) 


This lends itself to the following interpretation. In the 0 vacuum, there exists 
an electric field that is proportional to 0 with a corresponding energy density 
proportional to 6?. The external charges induce an electric field between them, 
proportional to their charges. The energy changes by the separation of the 
charges multiplied by the energy density, which in this case is clearly 


(0+ 2ra) . (8.62) 


There exist non-linear effects that convert these to periodic functions in 4. This 
is because the theory contains particles of charge e. For q > e, a charged particle- 
anti-particle pair can be created, which then can migrate to the oppositely 
charged external charges, lowering their charge and hence the induced electric 


field. Thus q’s are equivalent modulo e. 
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Our analysis, although encouraging, cannot work in higher dimensions. In 1+1 
dimensions, the flux of each instanton inside the loop must totally pass through 
the loop, independent of its position inside the loop. In 3+ 1 dimensions the 
instantons are not like flux tubes, they are O(4)-symmetric objects. Instead of 
the Wilson loop, we would require some analogous “Wilson three-dimensional 
hypersurface” to reach the same conclusion. Confinement must, however, imply 
the area law for the usual Wilson loop, in any dimensions. Thus we do not 
expect instantons to be responsible for confinement in higher dimensions. We 
can, as we shall see in Chapter 9, circumvent this problem in 2+ 1 dimensions 
by introducing a mild non-Abelian nature. 


9 
The Polyakov Proof of Confinement 


In a totally surprising result, Polyakov demonstrated that instantons could 
provide the key to confinement in a particular model in 2+1 dimensions [103]. In 
this chapter, we will study in detail the Polyakov proof of confinement. We will 
see that it requires a mild non-Abelian aspect to the theory, but the confinement 
occurs essentially because of the existence of magnetic monopole solitons in the 
theory. Purely Abelian gauge theory also contains magnetic monopoles, but they 
are singular configurations of infinite energy, and hence of no import. The minor 
non-Abelian excursion simply allows for the existence of finite action (or energy) 


magnetic monopoles. 


9.1 Georgi—Glashow model 


We continue our study of quantum electrodynamics in 2+ 1 dimensions; however, 
now we shall consider a theory that is Abelian at low energy but non-Abelian at 
high energy. This occurs due to spontaneous symmetry breaking. We consider a 
non-Abelian gauge theory with gauge group O(3) ~ SU(2) spontaneously broken 
to U(1). The model is the 2+ 1-dimensional version of the Georgi-Glashow model 
[54]. The fields correspond to an iso-triplet of scalar fields interacting via non- 
Abelian gauge fields and self-interactions, the Lagrangian density is given by 


1 1 2 
aF +|D.P -AC 0") , (9.1) 


eT pv 


where 


a _ a a abc 4b ac 
F2, = 0,A¢— 0,A% + €%° Ab AS 


=| P|, (Duo = Oud" +e?" ADO 


lo? = 69". (9.2) 
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The theory is invariant under local redefinition of the fields by 


pe > R(x’) g? 
Al RY AL + e R a" 0, R (2), (9.3) 


where R2°(x”) is a smooth, O(3)-valued gauge transformation. 

We may sometimes wish to use the matrix notation, hence we record the 
corresponding formulae here. The Higgs field is written as ¢, which is a three- 
real entry column. The gauge field is a 3 x 3 real, anti-symmetric matrix A, 
for each spacetime index u. There are exactly three independent anti-symmetric 
3 x 3 matrices where a basis can be denoted as T* with components numerically 


abc ( 


given by Ty. =e here the placement of the index as upper or lower is of no 


import). Then A, = Aj,T*. Then the gauge transformation is written as 


b> R(x") 
An R(x”) Ay + R(2”)ð RT (2”), (9.4) 


where R(x”) is a 3 x 3 orthogonal matrix (hence its inverse is given by its 
transpose). 

We can easily see the perturbative, physical particle spectrum of the theory 
by making a choice of gauge 

¢ =¢° =0. (9.5) 

To be honest, this is an incomplete gauge-fixing condition: it does not fix the 
gauge degree of freedom if ¢ is already in the three-direction and it does not fix 
the gauge transformations which leave ¢3 invariant. However, it is sufficient for 
us to extract the particle spectrum. Then, replacing ¢? = a +n we have: 


(Dud) = Oud" +e ALG" = ON ALO? = Aj (a+) 
(Dud) = Oud? + PALS = CALS? = —Al (a +n) 


(Dud)? = 0,0? + 6° A® G° = Opla +n) = Oyun. (9.6) 
Hence 
|D ao? = O,nd"n + (ALAM + AZA”) (a? +2an +7), (9.7) 
giving the Lagrangian density 
1 
L=- Tent + 0,70" + pno 


1 
+ (ALA + A? A?) (a? + 2an +77) 7^ (2an4 We)’. (9.8) 
This yields the quadratic part 
= a a av 
L= zZ (3, A5 — 3, A") (3t A”) + 3u” + (A4 A™ + A AP") a? — àan?. (9.9) 


The physical particle spectrum can now be read off from this equation; it 


corresponds to a massless vector field A}, two massive vector fields A}, and AŽ 
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of mass M? = 4e?a?, and a neutral massive scalar field 7 (neutral with respect 
to the gauge field A3) with mass m? = Xa. 7 is neutral since it does not couple 
to Ał, while the massive vector fields A} and A? are charged as they do. The 
two fields ¢, and @¢2 are, of course, absent. We might say this is due to our 
gauge choice; however, the fact that the corresponding physical excitations do 
not exist is independent of the gauge choice. What we are describing is the classic 
Higgs mechanism [61], where the putative massless Goldstone bosons associated 
with spontaneous symmetry-breaking are swallowed by the gauge bosons that 
correspond to the broken symmetry directions. These gauge bosons consequently 
become massive. Hence the Goldstone bosons are absent, but their degrees of 
freedom show up in the additional degrees of freedom of the massive vector 
bosons (as opposed to massless ones). 

We will see in this chapter that, as in the case of the Abelian Higgs model 
in 1+1 dimensions in Chapter 8, the actual spectrum of the theory does not 
correspond to this naive spectrum. We will find that the theory in fact confines 
charged states due to the effects of instantons and that there are no massless 
states, especially there is no massless photon. The validity of the argument that 
the Wilson loop is able to subtend an appreciable amount of flux from the 
instantons, which was used in Chapter 8, becomes critical in 2+ 1 dimensions. 
As the size of the Wilson loop becomes large, it can subtend an arbitrary amount 
of flux from nearby instantons, and hence the effect of instantons is significant. 
In 3+1 dimensions we will see that the argument fails. 


9.2 Euclidean Theory 


Analytically continuing our action to three-dimensional Euclidean space 
(although much of what we say is trivially generalized to d Euclidean dimensions) 
gives 


Spx fbx (ZohtFn + 5(Dd)"(Dyo)" + a (9.10) 


which is again composed of three positive semi-definite terms. We look for 
finite action configurations: these would correspond to instantons and should 
be relevant for tunnelling. Finite action requires that the fields behave in such 
a way that each term in the action goes to zero sufficiently fast at infinity, as 
each term is positive semi-definite. Sufficiently fast can include ~ 1/r fall off of 
particular fields or their derivatives, the only condition is that the Euclidean 
action be finite, and hence each term vanishes sufficiently fast. This then implies 
that at infinity 


G+ RP(Q)O5 popi = 0? (9.11) 
(Do) 0 (9.12) 
F*, +0, (9.13) 
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where Q are the angular coordinates parametrizing the sphere at infinity. 
Equation (9.13) requires that the gauge fields approach a configuration that 
corresponds to a pure gauge transformation of the vacuum, sufficiently fast. We 
can write the gauge field in a matrix notation 


A= Aer, (9.14) 


where T° are 3 x 3 matrices with components numerically given by T£, = €7°°. 


Then Equation (9.13) implies, in this matrix notation, that the gauge field 
corresponds to a gauge transformation of zero, 


Ayu Ra, (9)3 Ri (Q). (9.15) 


Then automatically for the covariant derivative of the scalar field we get 
(suppressing the Q dependence and its index a) 


Dy > (ð, + Ra, AR, Redo 
ah, (RLa,Re +RERa, (3L Rİ,,)Re) öö 
= Ro (RÌ Ra,.9,(Rh,, Ro)) 0 =0. (9.16) 


This requires that RRA, On (Ri, Ro), which is a Lie algebra element, be in the 
direction that annihilates ¢° or correspondingly Ri, Rg leaves ¢° invariant, that 


is Bi Ry =H where H¢° = ¢°. H may not be globally defined on the sphere at 
infinity; however, locally it is, and that is all we need. This defines the invariant 
subgroup or stabilizer of ¢9. But now we may redefine Ry —> Rg = RsH~! as 
Rọ is only defined up to an element of the stabilizer of o, as is obvious from 
Equation (9.11) (we will drop the tilde from now on). Thus we get Ri Rg =1 
at least locally on the sphere at infinity. Although we started with different, 
independent gauge transformations, Ry and R,,,, in Equations (9.11) and (9.15), 
respectively, we see that Equation (9.12) forces the gauge transformations to be 
the same. We will now call this gauge transformation R(Q). We underline that 
R(Q) may not be globally defined, and may actually be singular at some place 
on the sphere at infinity. In fact, for a non-trivial mapping it must be singular 
somewhere. However, its action on ġo, which defines the values of the Higgs field 
at infinity, must be globally defined. 

The condition of finite action is actually a little more subtle. Indeed, the gauge 
field must become pure gauge only as fast as ~ 1/r for the F*, F“, to give a finite 


uvt uv 
contribution. Thus we should modify Equation (9.15) to 


A, > R(Q) Rİ (Q) + A,, (9.17) 


where A, ~ o(1/r) (keeping in mind that the pure gauge terms also behave as 
~ 1/r). However, such a modification could cause trouble in Equation (9.12), 
as the covariant derivative of the scalar field must vanish faster than 1/r? for 
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finite action. But this can again be solved if these additional possible terms in 
the gauge field are in the direction of the stabilizer of the Higgs field. Thus we 
can tolerate additional non-pure gauge terms in the gauge field as long as 


A, Roo = 0. (9.18) 


9.2.1 Topological Homotopy Classes 


Thus finite action configurations are characterized by R(Q) defined at |z| — oo. 
This defines a map of the sphere at infinity S¢~! (generalizing temporarily to d 
dimensions) into the space of “vacuum” configurations, {¢* : ¢°¢% =a?} = M = 
5S”, The equivalence classes under homotopy of these maps form the homotopy 


groups 
Ha-1ı (M). (9.19) 
There is a fascinating and complex set of corresponding homotopy groups [51]: 
0 d=2 
Z d=3 
Z d=4 
Zo d=5 
Zo d=6 
Zı2 d=7 
Zo d=8 
Zə d=9 
Ha- (M) = fi TN (9.20) 
Zı5 d=11 
Zo d=12 
Zə X Zo d=13 
Zı2 X Ze d=14 
Zs4 X Z2 X Za d=15 
Zə X Zo d=16 


Thus there exist topologically non-trivial configurations in each dimension and 
the possibility of non-trivial finite Euclidean action configurations. In d = 3, the 
corresponding instantons are actually the ’t Hooft-Polyakov magnetic monopole 
solitons of the 3+ 1-dimensional theory. 
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Figure 9.1. Mapping the whole S$? at oo to a point 


© 6 


Figure 9.2. Mapping the S? at oo on to the vacuum manifold S$? 


9.2.2 Magnetic Monopole Solutions 
For d= 3, we have the maps 
R(O)do: 8? > 9, (9.21) 


where the first S? is defined by the set of all Q’s, i.e. the sphere at oo, while the 
second S? is defined by the set of Higgs field values 6? = 67¢% = a?. These fall 
into homotopically inequivalent classes, characterized by the winding number of 
the map, much like the previous case of maps of St — S! in the Abelian Higgs 
model. Pictured in Figures 9.1 and 9.2 are the trivial map to a point and the 
onto map, where each point in the first S? is mapped to the analogous point on 
the second $?. We cannot continuously deform one configuration into another if 
they have different winding numbers, that is the definition of homotopy classes, 
and typically this implies that there exists an infinite action barrier between 
configurations in different classes. We will see that the topological winding 
number turns out to be associated with the magnetic charge of each sector. The 
minimum action configuration in each class must solve the equations of motion. 
The action must be stationary at the minimum action configuration since, if the 
first-order variation does not vanish, one can find a variation which lowers the 
action. The equations of motion are therefore satisfied. What is not necessary 
is that the minimum action configuration is non-trivial; it could, for example, 
collapse and shrink to a point or, conversely, spread out and dilute infinitely. We 
will show that it must be non-trivial. 

The homotopy class with topological winding number n = 1 defines the 
standard instanton. We can prove that the action is bounded from below in 
each sector using a method first shown by Bogomolny [17]. We assume that the 
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potential V(@) is positive semi-definite. Defining the non-Abelian magnetic field 
as BY = Zeij F sh we have 


Ce I dx GEZ, 5(Di0)"(D:0)" +V(0)) 


> [eo (Fewer) + Zi (Dig) 


>+ per da (Did)* 

=+ / Pe re 

= + J d’xô; (BEH) — (A,B) ¢" — B7 Atos) 

=+ ( f dS (B2 ¢*) — J da (3; B? + A? BF) 6) 

= +ga, (9.22) 


where in the second line we have simply completed the square and dropped 
the potential, in the third line we have dropped the positive semi-definite first 
term and in the penultimate equation the last term vanishes because of the 
Jacobi identity. The Jacobi identity is €;;k|D:, |D}, Dk]] = 0 which is simply, 
trivially, algebraically valid (just spell out all of the terms and they cancel 
pairwise). This gives D; B¢ = ð; B$? + °° A? Be =0 as [D¢, D}] = cjki e” BE which 
is the non-Abelian analogue of Maxwell ’s equation V - B=0. Normally, in the 
purely Abelian theory, this equation denies the existence of magnetic monopoles. 
Here the magnetic monopoles do exist, since the non-Abelian divergence of the 
magnetic field contains inhomogeneous terms. The magnetic monopoles exist 
as instantons in the Euclideanized 2+1-dimensional theory or as actual static 
solitons in the 3+1-dimensional theory. g is the magnetic charge 


1 , 
g=— f dS' BEG" (9.23) 
ae 


and a is the vacuum expectation value of the scalar field. Clearly, if g is positive 
we take the plus sign in Equation (9.22), and if g is negative we take the minus 
sign. This implies that the Euclidean action has a positive definite lower bound 
in each topological sector. We will show g Æ 0 except in the topologically trivial 
sector. Indeed, for the ansatz 


= H (aer) — 
er 
_. 9) 
At = ehi = — K(aer)) (9.24) 
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finite action requires 


H(aer)—aer , ro 

K(aer) +0 , roo 

H(aer) < o(aer) , r=>0 

K(aer) < o(aer) , r—0. (9.25) 


Thus for large r 


ogi R 
AS we etti 5 = Ra, Rt + As (9.26) 
giving 
aka 
FS eign a (9.27) 


Defining the Abelian magnetic field as 


1 PE «et 
Di = 5 €ijk ee igs (9.28) 
we have 
1 4 
j= t f asıB; = = £0. (9.29) 


This is in fact the Dirac quantization condition on magnetic charge, gq = 27, for 
the minimal electric charge q = e/2. Not surprisingly, the theory knows that it 
can, in principle, have fields in the spinor representation of the iso-spin group 
(SO(8)) that do carry charge e/2. 

For the Higgs field satisfying the conditions of the “Higgs” vacuum 


pio Z az 
(Duo)* =0 (9.30) 


we can write the explicit solution, using the iso-vector notation $ for the Higgs 
field 


a 1 T AS 1 a 
At =- (6x 0,4) +=4°A, 
a 1 a 
Pav = 79 Fw, (9.31) 


where 


1 = \ a 
Fv = we (a6 x a6) + Op Av zj vAn: (9.32) 


A,, generates only a source-free magnetic field, but 


1 
Bi = 9 Sisk Tjk (9.33) 
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can have non-zero magnetic charge due to the first term in Equation (9.32). The 


magnetic charge in any region is 


2a $ oA r $ dS Eare€® $B; Pp. (9.34) 
ejz 2ea’ Js 

We will show that this integral is actually a topological invariant and equal to 

the result 47/e that we found for the configuration in Equation (9.29) above. It 

counts the winding number of the map from the surface © which is topologically 

S? into the S? defined by 6%¢% = a?. Indeed, consider a variation 6¢ which is of 

compact support, then e > È + 6b but since (6 . 3) =1 we get 


ò (8-8) 258 65 20, (9.35) 
Then 
5 (8: (0:8 Od) =F- (046 x Id) +8 (80x Od) +4: (jE x röð) 
= ô$- (jE x Ad) +0; (4: (65 x $) ) -8 (56 x Ae) 
E- (55 x jak) +O (4- (06 68) 


3g- (56 x ad) - re Q And x 50) 
=355- (0)8 x Od) +28; (4: (56 x I$) ) ; (9.36) 


where, in the last step, we use that the expression is contracted with «tt. The 
total derivative terms give no contribution to any integral since 6¢ is of compact 
support. Now 0;¢ and pġ are both orthogonal to ¢, thus a; x nd is parallel 
to @, giving 
ôg- (0,6 x a6) = (9.37) 

hence 

5 (b:(0)8 x &8)) = 20; (8: (58 x A9)) (9.38) 
Therefore the integral, Equation (9.34), is invariant under arbitrary continuous 
deformation of ¢, since these are built up from a sequence of infinitesimal 
deformations of compact support. A continuous deformation of the surface over 
which the field is defined can also be interpreted as a continuous deformation 
of the @ field, thus g is also invariant under continuous deformations of the 
integration surface (remember that we are only in the Higgs vacuum). 

Finally we can calculate g for 


p =at* =a—, (9.39) 


asymptotically, which corresponds to the winding number equal to one map. 
Then 


ipa g“ ggi a ai sani 
g =a Z =t oes (9.40) 
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which gives 

F eacop Ono? = aPeijkeate (57° — 478") (5° — 272°) = =e (9.41) 
Hence 3 

1 20” 5 1 Ar 
= — ¢ dit = — 8ra? = —. 9.42 
I= Jea? f re” T ea S e ea) 
This answer is robust, in that it does not change for any infinitesimal changes 
and hence for any continuous change in the Higgs field. If we use the winding 
number 2 map, the answer for the integral will be 2 x 4r /e, and so on. If we write 
¢= Rġo, then the winding number N map is obtained by taking ¢ = RN do. 

If we transform ¢% > ° = 68a, we cannot define the gauge transformation 
globally over any surface containing the core. We get the usual Dirac string 
singularity, 

1 (1—cos@) 


Aj = ail sind Pa; (9.43) 


where ¢ is the unit vector in the azimuthal direction. 


9.3 Monopole Ansatz with Maximal Symmetry 


The solution follows from the most general ansatz 


xe 
a— H ae 
O) (aer) ae 
7 xi r28 — rige P 
Aç = — 22a = K(aer)) + as Baer) + 32,3 C (aer), (9.44) 


which is symmetric under the diagonal subgroup of the group SO(3)rot. X 
SO(3)iso—rot. Of rotations and iso-rotations. If we had imposed invariance only 
under the SO(3)rot., the rotation subgroup alone, we would have to impose that 
$% is a constant on each spatial sphere, giving trivial asymptotic topology. On 
the other hand, the configuration that is invariant only under SO(3)jiso—rot., 
the iso-rotational group, has the only possibility ¢* = 0, which also has trivial 
topology. However, we can impose invariance under the next subgroup available, 
SO(3)diagonal, the diagonal subgroup of rotations and iso-rotations, which the 
fields in Equation (9.44) satisfy. 
Parity corresponds to the transformation 


Ps p (2! ,t) =} $°(—2!,t), AS (x,t) = —A%(—a!,t) (9.45) 
and there is also the discrete transformation 
Z: (x,t) > —$%(x,t), A‘ (x,t) > A‘ (x,t). (9.46) 


P and Z individually reverse the magnetic charge, thus we cannot impose 
invariance under each separately. However, their product leaves the magnetic 
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1 H(aer)/aer, K(aer) 


H(aer)/aer 


Figure 9.3. The curves of H(aer)/aer and K (aer) 


charge invariant. Hence, in the spirit of imposing the maximum symmetry on 
the solution without making it trivial, we impose that the ansatz be invariant 
under PZ. This implies B(aer) = C(aer) = 0. 


9.3.1 Monopole Equations 
We find, then, that H(aer) and K(aer) satisfy the system of equations 


Poa Kr) = Kr) (r) + K(r)(K(r) = 1) 
PG (r) =2K(r)?H(r) + HOH) ar), (9.47) 


They have numerical solutions as depicted in Figure 9.3. In the Prasad- 
Sommerfield limit [104], A — 0, we know the exact solution 


H(aer) = aer coth(aer) — 1 


aer 


K(aer) = (9.48) 


sinh(aer) ` 
This solution corresponds to the famous ’t Hooft—Polyakov magnetic monopole. 
In 3+1 dimensions it is a static, stable, finite-energy solution to the equations 
of motion. In 2+ 1 dimensions, but Euclideanized, it serves equally well as a 
finite-action, Euclidean space instanton, where it mediates tunnelling between 
different classical vacua, as we will see below. 


9.4 Non-Abelian Gauge Field Theories 


We must examine in some more detail what it means to have a quantum non- 
Abelian gauge theory. 
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9.4.1 Classical Non-Abelian Gauge Invariance 


First we will consider non-Abelian gauge invariance more generally, and then 
apply it to our specific case. A non-Abelian gauge theory admits fields which 
transform according to given representations of a non-Abelian group, 


o7U(g)o UG)EG, (9.49) 


where U(g)Ut(g) = Ui(g)U(g) = 1. If g does not depend on the spacetime 
point, we call the gauge transformation global, otherwise it is a local gauge 
transformation. However, the allowed variation of the gauge transformation is 
restricted to a region of compact support. It is easy to write a Lagrangian that 
is invariant under global gauge transformations, we simply construct it out of 
invariant polynomials of the fields. Spacetime derivatives commute with the 
gauge-transforming field U(g) and hence cause no problems. Now if we want 
to generalize the invariance to include local gauge transformations, we must 
introduce new fields. For our case 


=> (U(9)o)" 
(anor => 3" (U(g)9)" 
= (U(g)a"o)" + (("U(g)) 6)". (9.50) 


That is, if U(g) depends on the spacetime point, the derivative does not 
commute with it. We must introduce a new field, the gauge field Af, with an 
inhomogeneous transformation property which will exactly cancel the extia term 
generated by the derivative. We replace all derivatives by 


ð, +O, + Ap, (9.51) 


where A,, is a vector field with values in the Lie algebra of the representation 
under which ¢ transforms. In our case 


A, = A? (=°), (9.52) 
thus 
(D up)? = 0,67 — AP bac ge 
= =0,¢°+ ee Al ge. (9.53) 


A, is given the transformation property such that the covariant derivative 
transform covariantly: 


Did U(g)Dud. (9.54) 
This is satisfied if 
Ay > U(g) (Au + ,)U (9). (9.55) 
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Evidently 
Dp = (Ou + Ap) b> (Ou +U(Q) (Ay . .)U (g))U(g)e 
z (g 
(3, + Ay) re oe +U(g) (O,U"(g))U(g)) $ 


(On T ee Q 
g)Du o (9.56) 


The covariant derivative has the geometrical interpretation as the parallel 
transport in a fibre bundle with connection A,,. For each infinitesimal path, 
xc’ — a! +da", we introduce the gauge field A(x”) and an element of the group, 


g(x +dz, A) =1+dz" A, (9.57) 


Then for a finite path C we integrate this as 


g(C, A) =P (f | asta, b), (9.58) 


where the P symbol means the path-ordered integral. Intuitively this corresponds 
to the limit taken by multiplying the group elements of the form (9.57) for a 
finitely discretized approximation to the finite curve C, in the order corresponding 
to the direction of the curve, and taking the limit that the discretization becomes 
infinitely fine. The other definition, which yields the same result, is to expand the 
exponential and then perform the multiple integral at each order, after applying 
the path-ordering to the integrand. A field is considered to have been transported 
in parallel in the connection A,, if 


o (2+ da) =g 0 tAn) (x) =U (g(a + de, A,))d 
= ġ(x) + dx" A p(z). (9.59) 
Then, in general, 
le + dx) — g9et47 Au) (x) = dz” (8, + Ay(2)) AlE) 
= dz" D,,0(2) (9.60) 


defines the covariant derivative in the connection A,. Here A, = Ait", where 
t° are the generators of the group in the representation that ¢(a) transforms 
under. 


9.4.2 The Field Strength 


To construct the non-Abelian field strength we must consider a generalization of 
the Abelian version, 


Fav = p Av — Op Ap. (9.61) 
This is invariant under Abelian gauge transformations 


Ay —> Ay +ið A 
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(z,y+dy) 
(x+dz,y+dy) 
(x+dz,y) 
(x,y) 


Figure 9.4. An infinitesimal closed loop C 


SF yy = (0,0, —0,0,) A =0. (9.62) 


We can write this as 
Fay > e OP ye SF (9.63) 


For Abelian phases, F» is invariant, but if we generalize this formula to non- 
Abelian groups, F, does transform, but homogeneously. We construct F y via 
parallel transport. The same construction works as in the abelian case. Consider 
a closed loop C drawn in Figure 9.4, and 


we,2.A)= enf- far, 


=1- pata, + f dar dx} Ap (£2) Ar (z1)+---. (9.64) 
LQ>ry 

This group element transforms covariantly. Infinitesimally for each segment of 

the curve C, we find 


g(z+dax, A”) =1-— dx" A7, 
=1- dx"U(g) (Ay + 3p JU (g) 
=U(g) (1 — dz” (A, + (A,U'(g)) U(g))) Ut (9). 


(9.65) 

Now, 

U(g(a)) (1 — da (3 U (g(a))) U(g(a))) 
=U(g(x)) — da"U(g(x))d," (g(x) )U (g(a) 

=U(g(x)) + dx"0,U(g(x)) =U(g(x + dz)) (9.66) 

hence 

g(a + dx, A9) =U(g(a +dx)) (1 — dx" A,,)U" (g(z)) 

=U(g(a+dx))g(a + dz, A)U' (g(x). (9.67) 


Thus for the infinitesimal closed loop, as in Figure 9.4, starting and ending at x 


g(C, £, A9) =U (g(2))g(C, x, AU! (9(2)), (9.68) 
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which is exactly the covariant transformation property. Considering the second- 


order term in the expansion in Equation (9.64), we have for each straight line 
path part of the contour of direction l, 


1 1 
facta, = dtl” A (£ +t) z) dtl” (A (2) +17t05A,,(x”)) +00) 
0 0 
1 
=I" A (£) + gh U ĉo Aple”) Ae (9.69) 
Thus for the closed path we get to second-order contribution 


f dc" A„(2”) = { (aeran) + silat ded A,(2")) 


1 
+ (ay Ayla" +dz”) + sia! dy? ðs Anla) 


1 
+ (irana +dz” + dy”) + gia da” Oo Ay e) 


+ (avanta + dy") + dydy BoA (x”) 
Š { (dora, (2) + sila ded A,(2")) 
y (a [Ay (2) + dx? Oo Ay (2")] + Z dydy" Oo Anla”) 
+ (—de![A,(2”) + dv, A,,(a”) + dy? pA, (2”)] + site dea A,(2")) 


+ (dy gle") tay’ OoAy(e")] + 5dy'dy"2, Anla) } 
= dx? dy" (8¢A,,(2”) — Öp Ac (2”)). 
(9.70) 


Notice that this term contributes with a minus sign in Equation (9.64). When 
integrating along one side in Equation (9.64), the second-order term gives directly 


atdx T2 1 t 
| dat ‘| de!’ A, (04) A,,(0¥) = i dt (4, (2 +t) I dsl” A, (a +0") 
x Ba 0 
5 - of dsl” Ag( 2) 
1 


dt (“A,,(x”)tl? A, (2”)) 
0 


= ee )Ac(2”). (9.71) 


The two integrals simply factorize when the integrations are on two different 
segments and no factor of one half is generated. Hence adding up the 
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contributions around the loop, substituting for I” with dz” or dy" gives 


farf _ detiAutor) = i = dy“ Ayla” + dy") 
LQ>xL1 
1 
x (au? Ae" + dy”) — dx? Ag(a” + dx” + dy”) 
+dy° A,(a” + dx”) + dx A, (2”)| 


—da")Ag(x” + dx” + dy”) 


taal Aula”) | Sao Aol I 


1 1 
+ {Say Aula” dy" Aola” )— gat" Aula” dx? A(x") 


dat Aa \dy? Age) 
1 1 
+5 dy" Ayla" )dy? Ag (x") + dy" A,,(a")dx° A,(x”) 13d" Ay (0)? Aoa") 
= -dx dy" [As (x )A (£) — Ap(2”) Ag (2”)]. (9.72) 


Adding up the two contributions, Equations (9.72) and (9.70), simply gives 


Pexp{da" A,,(a”)} = —dx7 dy" (0, A u(x”) — Op Ao lT”) 
+ [Ao(2”), Ay(x”)]) + o(dx)? 
= —dx° dy" Fp, + o(dz)*, (9.73) 


which must transform covariantly. Actually we can write Fuy as the commutator 
of two covariant derivatives, 


Fup = Liab | = [Ou As Oy, +A] 
= [O., Av] + [An Or] + [Au Av] 
= ð, Ayo ðv Ap + [An A]. (9.74) 


Then, due to the algebraic structure of Fv, we immediately know that the Jacobi 
identity will be satisfied, 


[Dp [Dv,Do]] + [Do [Du Da] + [Dr [Do Da] =0 
> [Dus Fue] + [Do, Fav] + (Dv; Fou] =0, (9.75) 


which in four dimensions is exactly the Bianchi identity, 
Ope’ For + [Ay, "T For] =0. (9.76) 


Thus F,,, is the appropriate covariant generalization of the usual Abelian 
definition of the field strength. 
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9.5 Quantizing Gauge Field Configurations 


The physical (non-gauge) zero modes of the action come from translations of the 
positions of the monopoles and rotations of the monopoles in iso-space. This gives 
simply the volume of spacetime and the volume of the gauge group as a Jacobian 
factor. However, things are not so simple, since in a gauge theory there are lots 
of unphysical zero modes associated with gauge-equivalent configurations. The 
naive functional integral for a gauge theory is not well-defined, even in Euclidean 
space. 

The Lagrangian of a gauge theory is called a singular Lagrangian, the equations 
of motion do not give rise to a well-defined initial value problem for the gauge 
fields. Obviously, if we fix the initial data, and find a solution of the equations 
of motion, there actually exist an infinite number of solutions of the equations 
of motion that satisfy the initial conditions, which are simply gauge transforms 
of the original solutions. The gauge transformations, of course, must be time- 
dependent, so that they do nothing to the gauge fields on the initial hyper- 
surface, but they do modify the gauge fields afterwards. The freedom to do 
time-dependent gauge transformations allow for this, and the solution of the 
initial value problem is not unique. Thus fixing the gauge becomes essential to 
define even the classical dynamics. Correspondingly, the quantum dynamics also 
requires gauge fixing in order to be well-defined. The important point is that, 
because of the gauge invariance, the actual physical content of the theory does 
not depend on the choice of gauge fixing. 

The action is invariant under the infinite dimensional group of gauge 
transformations, G. Thus 


N l Dearden ERC) (x paoe) (977) 


gauge 
inequivalent 


as geometrically drawn in Figure 9.5. The volume G is, of course, infinite, it is 
not just a few zero modes which arise as in the propagator, but an infinity of zero 
modes due to arbitrary local gauge transformations. This infinite volume should 
cancel between numerator and denominator; however, we must realize how to 


define 


N D(A,d)e™ (9.78) 
gauge 
inequivalent 


properly, i.e. in a gauge-invariant manner. The method for defining this integral 
is to begin in a canonical gauge, where the quantization is understood and 
well-defined, and then transform to any other gauge in an invariant way. This 
procedure was first spelled out by Faddeev and Popov [44]. 
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A G 


A/G 


Figure 9.5. The space of all gauge fields, corresponding to the space A/G with 
leaves, foliated by the group of gauge transformation G 


9.5.1 The Faddeev—Popov Determinant 


We will start with the gauge choice 
A? =0. (9.79) 


This gauge condition is complete, which means that we may not make any further 
gauge transformations whose derivatives are of compact support. These are the 
so-called local gauge transformations, those that go sufficiently fast (often taken 
to be of compact support), to a constant at infinity. This constant is usually 
taken to be the identity. We insist on the gauge choice, that is, A? =0, then any 
gauge transformation must satisfy 


A? +g" *03g=0 = 03g =0. (9.80) 


But then g must be a global constant, everywhere equal to its value at 
infinity, chosen to be the identity. It is easy to convince ourselves that no local 
gauge transformation can be non-trivial and still be independent of x?. Hence 
Equation (9.79) is a complete gauge-fixing condition as far as the group of local 
gauge transformations is concerned. We define 


~SplA,4) 


T=N [D(A,¢)5(Aa)e S (9.81) 


For any other gauge choice such that F(A;) = 0 there must exist a gauge 
transformation go(A) such that 


(Aye =o, (9.82) 


since it is understood that the set of gauge orbits of a given gauge slice must 
span the entire space of gauge fields at least locally.! 
We define A(A) by 


1= A(A) | Dg (F(AN)), (9.83) 


1 The Gribov ambiguity maintains that this is not exactly true. There do exist multiple 
gauge field configurations that respect the same gauge condition. However, these 
configurations are typically a finite distance away from each other. Thus the configurations 
that satisfy the gauge-fixing condition and their gauge orbits certainly give a complete 
foliation of the local neighbourhood of the space of gauge fields. 
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where Dg corresponds to the integration measure for integration over the full 
group of local gauge transformations. This measure is defined in an invariant 
way, formally, the metric on the space of gauge transformations is defined as (in 
d dimensions) 


gs j detr ((978g)(g7+59)) . (9.84) 


Here dg corresponds to an element of the tangent space of the group of 
gauge transformations, this is called its Lie algebra. If h is an arbitrary fixed 
element of the group of gauge transformations, then the left multiplication 
by h in the group gives left multiplication of the algebra, 6(hg) = hôg and 
the 1-form g~‘dg is left-invariant, as is the metric Equation (9.84). The 
metric is actually also invariant under right multiplication, since d(gh) = 
(dg)h, but then tr((gh)~"d(gh)(gh)*d(gh)) = tr((h-*g *(6g)hh-*g" *(6g)h) = 
tr((g~1d9)(g~1g)). Dg is then formally the corresponding volume form. We 
will mostly need to integrate over an infinitesimal neighbourhood of the identity. 
Here, with g =1+ a, where a is an infinitesimal element of the Lie algebra, we 
have, since g~!dg = a to first order, and the analogue of the Euclidean geometry 
in the space of all a’s 


lal? = [aim (a°). (9.85) 


This then allows for the replacement Dg —> Da with free, linear integration 
over q. 
Notice that A(A) is gauge-invariant, for an arbitrary gauge transformation h, 


A(A") = A(A). (9.86) 


This is because the integration measure over the group of gauge transformations 
is expected to be and can be defined to be gauge-invariant, that is, 


soak ((AP)9 ))= [Do5 Pa 
= 2) g 
Í REL F(AS j= Dg 5(F(A’) 


-AA 


A(A) is called the Faddeev-Popov factor. (We call to your attention that (A?) = 
Ag h as the group action works by left iis an Then 


I= N f D(A) 5(Ag3)e ia A) | Dg6(F (an) ) 


=N [D9 | D(A, o) (A3)e— ar A(A)6 (F(A?)) 


(9.87) 


1 


=N / Dg I D(A,6)5(Ag ee 6 (F(A,)) 


=N fD A,¢)8(F(As))e aca) ( [Dost Ag” i (9.88) 
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Now let 
g7’ =g "g0(A) (9.89) 
such that 
(43) =0. (9.90) 


For a given g, g’~* will depend on A; however, the integration over all g’ will 
not, as the integration measure is invariant under left or right multiplication, as 


explained in the discussion after Equation (9.84). That is 


[Ps (45`) = f vga (cay (420)9 p- foss(o 3 (9.91) 


is a constant, independent of A, and so we can absorb it into the normalization. 
Thus we get 


=N" | DADEA AAT. (9.92) 


We see how to change the gauge from the choice A3 = 0 to an arbitrary gauge 
choice F(A;) =0, the integration measure must be appended with the Faddeev— 
Popov factor. The Faddeev—Popov factor, 


= i Dg 6(F(A%)) (9.93) 


will only get contributions from the infinitesimal neighbourhood of A around the 
point where F(A) = 0. Thus for A satisfying the gauge condition, we have, with 
g=1+a, where a is an infinitesimal element of the the Lie algebra, 


F(A!) = F(A) + [dys Dray) +o’), (9.94) 


since the change in the gauge field is exactly ôA; (y) = D;a(y) and the integration 
is over a with measure Dg — Da. Then generalizing the standard property of 
the integration over a delta function f d”xô(M -x)= (detM)~', we get 


a= [Des{ fu e ag) O) 
Se (- Dis ( / Pad(aly))) (9.95) 
The last factor is 1, thus 


A(A) =aet (—D ZE am): (9.96) 


This expression is usually re-expressed as a fermionic functional integral over 
the so-called Faddeev—Popov ghost fields, which formally gives the determinant; 
however, for our analysis, we will not require or implement this step. 
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9.6 Monopoles in the Functional Integral 


We want to calculate the functional integral 
TH SE(A,¢) 
(Ole 7 Jo) =N | D(4,9) a (9.97) 
We will calculate it in Gaussian approximation about the critical points 
of Sm(A,¢). This corresponds to integrating over the space of fields in 
the infinitesimal neighbourhood of the classical critical points, the monopole 
solutions. The usual understanding is that the contribution from the fields that 
are not in the infinitesimal neighbourhood of the monopole solutions will be 
suppressed by the exponential of the action. Knowing monopole solutions exist 
and are the critical points, we will get a result of the form 


a Spg(n monopoles) ie 2 
(Ole 0) =N SO eT et | (2) 


To make this expression quantitative, we must do three further calculations: 


A . (9.98) 


n=—Co 


1. Find the action for N instantons (nı monopoles and nz anti-monopoles with 
ny +ng=N). 

2. Identify and separate the zero modes in the spectrum of Gaussian fluctuations. 

3. Define the measure of functional integration to make the determinant in 
Equation (9.98) well-defined. 


9.6.1 The Classical Action 


As usual 
82s 
K eS Slee a: m A 
2 ra 2 2 
605 crit. ( ee ) ôdi vac. 


= K"-1, (9.99) 


where “crit.” stands for the critical point of n instantons, and “vac.” stands for 
the vacuum configuration. The last factor is equal to 1 which serves to define M 


Sz 
i (Se) 


The action for n widely separated instantons is n times that of one instanton. 


1. (9.100) 


vac. 


The number of such configurations behaves like 


ve 


n! 


~N 


(9.101) 


This “entropy” factor is, as usual, much larger than the corresponding factor when 
any subset of these n instantons are constrained to be close together, i.e. multi- 
monopole configurations. Even though the contribution of n widely separated 
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instantons is suppressed by the exponential of its action en" Sh, the “entropy” 
factor can be big for a large finite spacetime volume (V 8)”, until eventually the 
1/n! takes over as it will always eventually dominate. 

The action for a single monopole is defined by a function € (4): 


SE EW e (3) ; (9.102) 


mw ~a and the function e can, in general, only be calculated numerically; 
however, in the Prasad-~Sommerfield limit, e(0) = 47, S9, comes almost entirely 
from the integration over the core region 


À 1 
PaL? =e 2) (1+0/ —_)). ‘i 
= tL 2 ‘| +o ah (9.103) 


The correction to the action from fields outside the core behaves like ż, exactly 
the classical Coulomb self-energy of a magnetic charge. 
For n well-separated monopoles of charge Ande , in addition to the Coulomb self- 


energy of each monopole, there is also a Coulomb interaction energy, a correction 
that is additive 


dadb 
SE [Coulomb = 2e2 a [Za (9.104) 


— Zol? 


with qa = +1. Then 


2, da Wb 1 
Sg(n monopoles) = mH e(3) dM Yee 2 4 [Ea Sail o( 5) 


(9.105) 
where the small corrections exist because the monopoles are not point charges 


but spread out over regions of size = 1 R: The additional Coulomb interaction 


energy term is non-negligible and has profound consequences. 
9.6.2 Monopole Contribution: Zero Modes 
Now we are in a position to analyse the zero-mode spectrum. If we write 
Ai=A ta; ¢=¢"+¢9, (9.106) 


where a; and y are quantum fluctuations about the classical values, we have the 
expansion of the action to second order in the fluctuations, 


Szp=(Sz)at+(Sz)oat+-::. (9.107) 
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The first-order term vanishes because the equations of motion are satisfied for 
the classical fields, and (Sz) is given by 


1 1 1 
(Sz)2= pew E (D'a; — Dfa) tza (lai, aj] FS) + 3 lai, 01]? 
1 


+5 (DP) + SoH? (0°) p+ [Dhea] + DPS" lael] (9.108) 


with 
DP =ð; + [Ag . (9.109) 
This is a bilinear expression in a; and y, thus integration over these fields will 
1 
give det 2 (O), where the operator O is the hermitean, linear, second-order 


differential operator appearing between these fields in Equation (9.108). We 
expect O to have eigenfunctions as (although they generally will be a continuous 


set) 
o (ate) (4) =a (21), ono) 


We expect the eigenvalues to be positive or zero, since the classical solution about 
which we expand the action is a minimum of the action. It is important to see 
that for any n such that Q2 > 0 the corresponding eigenfunctions satisfy 


D§'a? + [6",6"] =0. (9.111) 


We will prove this from the hermiticity of the operator O, and the evident fact 
that 
a? = D'a(x), ¢°= [d"',a(z)] (9.112) 


is a zero mode of © for every choice of a(x). a? and ¢° are simply the changes 
induced by a gauge transformation, hence Sg (Ad +a?, p’ + 4°) = Sr (Aq, °), 
which is valid order by order. This implies 


0 


Ge = | ax(a?,0%0 =0. (9.113) 


Since O is hermitean, the modes for 0? > 0 are orthogonal to the zero modes 
hence 


0= pon (aza? + ¢"¢°) 
= [eee (a? D''a(x) +o” [¢",a(z)]) 
= J datr (8;(a% a(x) — (Dla? ale) + [4,6] a(x) 


= - f abate (Dfa; + [¢",6"]) a(x)) 
=> Da? + [¢",¢"] =0. (9.114) 
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The conclusion in the last equation is reached since the integral must vanish for 
any choice of a(x). The a(x) zero modes in Equation (9.112) are not physical 
zero modes, they arise from the gauge invariance. If we impose the gauge choice 


DPA; +[¢",¢] =0 (9.115) 


with the understanding that the classical fields are assumed to satisfy this gauge 
condition, we can show that the unphysical gauge zero modes simply do not 
exist. Indeed, the gauge condition implies 


0= DA; + [¢",6] = DE AF +a) + [6,67 + 9] 
= DAS! +4 lo", g] + D'a; + lo", g] 
= Dila,+[¢",¢]- (9.116) 


Then we see that the norm of the putative zero mode that satisfies the gauge 
condition Equation (9.115), that is Dfa? + [¢*', ¢°] = 0, simply vanishes: 


T datr (apa) +¢°¢°) = J dtr (Df'a(x) Dale) + ({",a(x)])?) 
=- f Pate ((DEDFa(e) + 6", [0%.a(@)]) ale) 
= - f Patr ((Dfla? + [67,¢°]) a(z)) =0. (9.117) 
This requires a? = ¢° = 0, that is, the pure gauge zero mode that satisfies the 


gauge condition simply does not exist. 
The Faddeev—Popov factor comes from the gauge-fixing condition 


F(A, ¢) = Df'(A;) + [67,4] =0. (9.118) 
Then following Equation (9.95) we have 


F(A, +) = DF (A; + Dfala)) + [67,64 [¢,e(2)]] 
= D'A: + DiDi ale) + [6,4] + [4",[¢,0(x)]] 
=D; A+ [6,4] + D7 Df a(a) + | 


Thus from Equation (9.96) 


A(A,¢) = det (Ds'D# T [o", [ġ, ) 
= det (D'DP! + [¢", [¢", ) (1+ 0(a;,¢)). (9.120) 


9.6.3 Defining the Integration Measure 


We can go further by defining the metric and integration measure on function 
space. We will integrate over an infinitesimal neighbourhood of the classical fields. 
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With 6A; =a; = A; — AS! and 6¢= y = ġ — ¢" to emphasize that we are in an 
infinitesimal neighbourhood of the classical fields, we can write the metric as 


- f dare ((6A,)" +(60)°) (9.121) 


The minus sign is to take into account the anti-hermitean generators of the Lie 
algebra of the gauge group taken in the definition of the gauge fields and scalar 
fields. This metric is gauge-invariant since the infinitesimal change in the fields 
transform homogeneously under gauge transformations, and hence the gauge 
transformation cancels out due to the cyclicity of the trace. We parametrize the 
space of all gauge fields as a sub-manifold which corresponds to those gauge fields 
that satisfy the gauge condition, which is called the gauge slice, and orthogonal 
directions which correspond to gauge transformations. These lead to those gauge 
fields that do not satisfy the gauge condition but lie along the gauge orbit of the 
gauge fields on the gauge slice. We can expand the variations 6A; and 6¢ in terms 
of an arbitrary, linear combination of the eigenmodes of the operator O, which 
respect the gauge condition, plus an arbitrary linearized gauge transformation. 
The eigenmodes translate us along the gauge slice while an arbitrary deformation 
off the gauge slice corresponds to a gauge transformation. Hence expanding to 
first order in €” and a(x) gives 


Ay = AS +) Ea; + Dala) 
o=e 4S erg | c! a(x)| (9.122) 
hence l 
(61)? = YE) - | Pate ((D#a(2))* + [6",0(2)]’) 
z -y (E)? f Pate (ale) (=DFDF — 6°] [6", Jale) 
= >> En)? -f datr (alx) (DEDS — [6", [6", )a(x)). (9.123) 


Thus the measure is given by 


(4i 6) = [] PAPA) = [Jas [] dato) jdet? (DED? + [4", [6% )) 
= [Jae Pa( (x)det 2 ((DS'D* + [¢", [¢%, )) 
í (9.124) 


using a direct generalization of the corresponding volume measure for a finite 
dimensional system, if ds? = }°,, gi;dx'dx! then the volume measure is dV = 
d"x,/g, where g = det[g:j]. Then the integration giving rise to the Euclidean 
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generating functional Equation (9.92) is given by 


T=N" | DADEA G) AA oe 
=N" / [din Pala)det? (— (DP DF + [6", [¢", )) 5 (F(Ai,o)) A(A, pe 
" (9.125) 
But 4(F(A;,¢)) = 5(D§'(Aj)+[¢",¢]) and then using the expansion 
Equation (9.122) gives 
J Da(x)6 (F(A;,¢)) = / Da(x)é (— (DED? + [6", [¢"',) a(z)) 
= J Da(«) det! (— (DDS + [4", [¢", )) ((a(2))) 
= det~' (— (DFD? + [6", [¢", )). (9.126) 


We notice that this factor will actually neatly cancel out the Faddeev-Popov 
determinant. Indeed, we get 


det? (DEDE + [4% [¢", ) 
det (Do! De! + [p [¢", ) 
~ [J déndet? (DIDF + [4% [¢", ), (9.127) 


n 


D(Ai,¢)A(Ai, p) = |] déndet (DFDE + [¢",[¢, ) 


where in the first line we have retained the full Faddeev—Popov factor multiplied 
by the factor coming from the measure and the integration over the gauge-fixing 
delta function. 

There are still the physical zero modes corresponding to translation and 
internal rotational symmetries. The rotations give a finite constant volume factor 
which eventually cancels. Naively these are for translations 


ao — N~2a,A% 
ph) = N73 (9.128) 


however, these expressions do not satisfy the gauge condition. Augmenting by a 
gauge transformation gives (with a* = — A?) 


af? = N72 (DkAF — D'AR) = N72 Fẹ 
p(k) = Nes (arp + [AF], o"]) = N72 Dag (9.129) 


with N =— fdxtr (as + (Dg) °). The gauge condition 
DF Fri + [67 Ded] =0 (9.130) 
is just the equation of motion. Under a translation 


5A; = AA DIGRA SR Fei = N26R,a™ 9.131 
Jorg 4 


9.7 Coulomb Gas and Debye Screening 183 


thus ; 
dé = N2dR, (9.132) 
and i 
Bek — NBR. (9.133) 
So finally the integration measure is 
3 = 
D(Ai,d)A=N2@R] J déndet? (DPD? + [o", [4° ). (9.134) 
n#0 
For one monopole we have 
Soe Sei ol Q? (Sp) 
T E 
n#0 n 
7 
2 A.) ™m > 
= [7a (3) ee) eR (9.135) 


from dimensional analysis and a is a function that can, in principle, be calculated. 
For N (not to be confused with the normalization above) instantons, nı 


monopoles and nz anti-monopoles, 


Gs 7 3D a Ze? Lar RR | 
ZN= WI [4 Rj y G G b (9.136) 
i j=l qa=+1 
and 
Gy N = T > dadb 
Be N f [ie aeae, (9.137) 
N,qa=+1 ` j=1 
where : 
2 A = x my 
c= "ha(4)e Va) ae (9.138) 
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This is exactly the partition function of a Coulomb gas. We know that such a 
gas has the property of screening. This is the same as confinement. Any electric 
fields will be cancelled exactly by a complete rearrangement of the particles in 


the gas. 
If we re-express Z as a functional integral 
re? 3 2 (N N > ; B 
Z= | Dye FI a Se, Sg | TL, (9.139) 
N,qa=+1 `” j=l 


Indeed, 


[Dee e T _ 
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= [ Dxe me? fPa( XV? x+i Da 4ad(F—Ra)x(@)) 


= [oye Ë LoCo ta Batet- (a)? taia (Ha) Dale) 


-28 (Bog Za aadlt Ra)(—z)( V?) (=) Xo wE- Ry) 


R? (ze?) 
L Ce ree S Pt Za tadl- Ra) (Se ) Zo wi- Rp) 

=C Re SPLE a dadl- Ra) eae ye qb 9 (T- Ry) 

=E zz sta Sa? £) a Ia9(F— Ba a) Eo d Fag Re 

7 nee ee 

Se me anor a b,aAb dadb TT -Rol (9.140) 


where we absorb a harmless divergence at a = b into the constant.? Thus (using 
e— e/2r in Equation (9.139)) we have 


ET 
-È J Bevo ye 3 =)\\~ 
= [ Dxe Š 3 fe 2cos(y(R)) 


= [AE E SS eae) 


= [Dye S @x((Vx)?—M? cos(x(x))) (9.141) 


with M? = 18. 
There are no massless modes. The coupling constant, nominally taken as ¢, 


m 


satisfies ¢ x Pa ees) << 1l as e —> 0. This means that there are no massless 
gauge bosons, the low-energy Abelian theory is confined due to the effects of 
instantons. This is an incredible result; the theory is confining. Unfortunately, 
the result will not go over to four dimensions. However, in three dimensions, 
where the general arguments concerning the flux subtended by a large Wilson 
loop are critical, we find that the theory nevertheless favours confinement. 


2 We have a slight discrepancy with respect to Polyakov’s paper [103]. We find that in 
Equation (9.139) we should replace e + e/27. This does not change the behaviour of the 
theory. We implement the change from now on. 
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Monopole Pair Production 


In this chapter we will study the analysis by Affleck and Manton [4] of the 
decay of constant, external magnetic fields due to the production of magnetic 
monopole—anti-monopole pairs. The calculation is analogous to a calculation of 
the decay of external electric fields by Schwinger [109] due to the production 
of electron—positron pairs. In both cases the effect is due to non-perturbative 
tunnelling transitions. 


10.1 ’t Hooft—Polyakov Magnetic Monopoles 


In Chapter 9, we saw the solutions that correspond to magnetic monopoles, 
in the Georgi-Glashow model [54]; however, as we were in 2+1 dimensions 
these solutions were instantons in Euclidean three dimensions. Clearly the same 
solutions in 3+1 dimensions correspond to static soliton solutions and correspond 
to particle states of the 3+-1-dimensional theory. There is a perturbative spectrum 
of particles corresponding to quantization of the small oscillations about the 
trivial vacuum. These particles correspond to a massless photon, a charged 
massive vector boson, and a neutral scalar from the Higgs field. We will consider 
the limit that the Higgs field mass and the vector gauge boson masses are very 
heavy while the photon remains massless. In this limit the monopoles are heavy, 
essentially point particles. We will see that in the presence of a constant external 
magnetic field, the Euclidean equations of motion admit instanton solutions 
that describe the production of monopole—anti-monopole pairs. The form of the 
instanton is surprisingly simple. 


10.2 The Euclidean Equations of Motion 


The solutions to the Euclidean equations of motion for a ’t Hooft—Polyakov 
magnetic monopole in a constant external magnetic field must exist in general, 
as the initial value problem for the corresponding set of non-linear differential 
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equations is well-defined. The solutions must be well-approximated by the 
solutions to the equations for point-like monopoles, certainly in the limit that the 
masses of the Higgs field and the massive vector bosons are taken to be very large. 
Then, apart from the self-action of each monopole being very large, the additional 
contribution to the action from the Euclidean trajectories of the monopoles will 
not diverge. The state of the system in the presence of a constant magnetic 
field should correspond to a meta-stable state, similar in principle to a false 
vacuum. There will be a finite probability for the creation of a monopole—anti- 
monopole pair. Creation of the pair of course costs energy; however, separating 
the monopoles in an external magnetic field gives back energy. After a separation 
to a critical radius, it is energetically favourable for the monopoles to separate 
to infinity. Thus the analogy to the decay of a meta-stable state is quite apt. 
The result is an exact analogy to the Schwinger calculation [109] of the decay of 
a constant electric field due to the creation of charged boson—anti-boson pairs. 
Schwinger found the amplitude 


2 p2 co ao] n+1 —nnm?/eB 
Š cy — (1+ 0(e2)), (10.1) 


T — 
873 aa n 


where EF is the amplitude of the external electric field and m is the boson mass. 
Manton and Affleck [4] found the result 


22 3 
r= A (1+0( +o) (10.2) 
with g the magnetic charge, B the amplitude of the magnetic field, and M the 
mass of the monopole, which corresponds to the first term in the expansion found 
by Schwinger, interchanging electric charge and field with magnetic charge and 
field. 

To find this amplitude, we will look for a solution to the classical Euclidean 
equations of motion that interpolate between the false vacuum in the presence 
of the constant background magnetic field, and the configuration containing a 
monopole-anti-monopole pair which are separating to infinity in the background 
magnetic field. The Euclidean solution will actually be a bounce-type instanton, 
thus we expect the pair will move apart up to a critical separation and then 
bounce back and return to each other and annihilate. The bounce point will 
correspond to the point at which the tunnelling occurs in Minkowski spacetime, 
and after the appearance of the physical monopoles in Minkowski spacetime, 
the magnetic field will pull them apart to infinite separation. The bounce should 
have one negative mode and all the rest positive. The negative mode will give rise 
to the imaginary part of the functional integral, with the appropriate analytical 
continuation. Effectively, the imaginary part of the functional integral is given by 


3m (TV Ke~*2), (10.3) 
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where 


-1/2{ Sp 


K= 


- (10 4) 
det—1/2 (5p ) 
0 


56: 
There are also some zero modes that give the usual complications, which we 


will deal with using the Faddeev-Popov method. Our conventions will be the 
following for an SU(2) gauge field A, = A/T“ and a scalar field in the triplet 
representation, ¢ = ¢°T, where Ty, = 7° are the anti-symmetric 3 x 3 matrix 
representation of SU (2), 


1/1 AÀ 
A Oa C (108) 


where [T*,T?] = èT", |¢|? = $%6%, Fury = 3 Ar — OvAy — [A n, Av] and Dyo = 
ô p — [4,9], and Mw provides the mass scale. The equations of motion are 


Dy Fu > [Di ¢, 4] 
D,D, = 2 (l4? — Mẹ) Q. (10.6) 


If we take A, = 0 and all fields independent of x4, the equations of motion reduce 
to the static, Euclidean three-dimensional equations that we have already studied 
in Chapter 9, and there is a finite energy, stable, static non-trivial solution of the 
equations corresponding to the magnetic monopole. The action of the monopole 
is, of course, not finite as the solution is independent of x4. The mass is 


M= rk 


k(A/e?) where k= 1 for \/e? <1 (10.7) 


and the magnetic charge is g = 47/e, the core radius is rą = g?/M and the 
“Abelian” field strength can be defined as fuv = Fi," /eMw. The Abelian field 
strength satisfies the Maxwell equation if |¢|? = M2, and D,,¢ =0. In the limit 
of A > 00, e? + 00 but A/e? remaining finite, the monopole core size goes to zero 
and it looks very much like a point monopole. 


10.3 The Point Monopole Approximation 


Then in an external, constant magnetic field, the monopole solution cannot 
remain static. In Euclidean time, it must respect the Euclideanized magnetic 


“Lorentz” force law 
dz ~ dzy 
M T = gfw Ts” (10.8) 


where z, is the position of the magnetic charge, s is a world line parameter 


normalized so that 44 4% = 


Gt Gt = land fe = T This equation is simply the 
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dual of the usual Euclidean “Lorentz” force for a charged particle in electric and 


magnetic fields 
zy dzy 
ds efu ds 


For the magnetic field with constant magnitude B in the three-direction, f12 = B 


(10.9) 


which means f34 = B. Then a solution of the equation of motion (10.8) is simply 


M B M B 
z3 = — cos (S75) z= — sin (Ss) l (10.10) 


The solution is obviously a circle. This is the analytic continuation of the 


zı = z2 = 0 and 


corresponding Minkowski space solution, which would be a hyperbola. 


10.4 The Euclidean Action 


The point monopole equations of motion are, of course, approximative, but we 
can derive them in the limit of a weak external magnetic field [4]. This circular 
Euclidean solution is exactly the bounce solution that we are looking for. We 
can equally well think of the solution in the (£x3,x4) plane as the creation of 
a monopole-anti-monopole pair, the two separating to a finite critical distance 
and then bouncing back together and annihilating. The diameter of the circle is 
the critical separation and corresponds to the point to which the pair separates 
in the Euclidean solution, but also the separation at which the pair appears in 
the tunnelling process, in Minkowski space. The circular solution neglects the 
Coulomb attraction between the monopole-anti-monopole pair. We will see that 
the Coulomb interaction does not greatly affect the instanton. To analyse the 
corrections, we consider the following decomposition of the action 


E E ext. pex 
SE = pes (<- aee) + [ates Gaal = oe r) l (10.11) 
where we have separated the Lagrangian into the first term that governs the 
dynamics above the Abelian gauge field and subtracted the action of the 


external gauge field. We define the dual Abelian gauge field into the core of 
the monopole as 


pfu a Jy 
Ont av =0 (10.12) 


where Ju is an appropriate, conserved, Abelian definition of the dual current into 
the core. Outside the core, jp = 0 and the source-free Maxwell equations are 
perfectly valid. Equation (10.12) are just the Euclidean, dual, Abelian Maxwell 
equations with magnetic sources. As these are just the dual Maxwell equations, 
there exists a gauge potential à, such that 


fu = pð — Ovðp- (10.13) 
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Ly 


Figure 10.1. Circularly symmetric monopole—anti-monopole instanton 


Exploiting the circular symmetry of the point-like solution we write 


å, = (0,0, — sin 6, cos6)w(x, y.z) 
ju = (0,0, — sin 8, cos) p(a,y.z), (10.14) 


where x,y are the normal cartesian coordinates, but z,@ are polar coordinates in 
the z3,x4 plane, with the radius shifted so that z = 0 corresponds to the radius 
of the circular point-like monopole instanton, i.e. the usual radial coordinate 
is r = z + R, as shown in Figure 10.1. Thus z = —R is the origin, and we will 
expand the action about z = 0. Then for the first term of the decomposition in 
Equation (10.11) we write 


Sl, =2n J va RA E iat, (10.15) 
where L Abelian = ne which can be evaluated from Equation (10.14) 


1 1 
L Abelian = (F(a + pyt + (10.16) 


1 
2(R+2)}? w) , 
where the index i goes over x,y,z and £ is of course the full Lagrange density 
given in Equation (10.5). Away from z = 0, we expect that the solution is 
exponentially zero, Dj¢ ~ V(¢) ~ e~Mw!#l and Per’ > fis fij, exponentially 
fast, and consequently (L — L Abelian) also vanishes exponentially. 

We make no great error by changing the range of z from —R < z < oo to 


—oo < z < œ, as long as all fields and densities are exponentially small away 
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from z =0, thus we get 

h= ack | da (L — L Abelian) + a (L—LaAbvetian) ; (10.17) 
where now the integral is over an entire three-dimensional Euclidean space. We 


expect that we can perform an expansion in powers of 1/R. The Maxwell equation 
for the Abelian fields is 


1 1 
iði 4 Ris man” (10.18) 
then if 
V@ =Y hD (10.19) 
n=0 


the density p(#) must also admit a similar expansion 


= DAG (10.20) 


as well as the Lagrange density £. The terms in the expansion must be of 
alternating parity as z > —z. The second term in Equation (10.17) vanishes 
to lowest order. The limit, as R —> œ, i.e. B — 0, which is the order n = 0 
term, the solution is simply a static monopole at rest, the circle has infinite 
radius and thus becomes effectively a straight, world line. Then the first term of 
Equation (10.17) just gives 


Si = 207R(M — Mapetian); (10.21) 


where M is the mass of the monopole and M Abelian is just the contribution to 
the Coulomb energy from the zeroth order part of the current density po(Z), 
while the second term must give vanishing contribution due to parity. Thus, due 
to parity, the next correction only comes at o (1 /R?). 


10.5 The Coulomb Energy 


The second term in the action, Equation (10.11), contains simply the energy in 
the Euclidean Abelian gauge fields, faw = = fee ae fert. where fie comes from 


the monopole loop, and fee comes from the fields oiii of the loop. Then 


EE 00 yin 
S? = i (ay _ ie pj: = = floor floop ping fii = SÈ loop hs S3 t 
(10.22) 
We will find 


RP= | dta herjer = | dadia Jlo jale) (10.23) 


872 jx — x"? 
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This can be shown by first observing that in the gauge 0,4, =0, we can solve the 


dual Maxwell field equation (10.12) for the dual gauge field simply as 4, = Aj, 


where the Green’s function is 


1 1 1 
= (10.24) 


An? |x — r'|? 


and the dual field strength is as usual 


~ J= Le 
fuv = p= jv — OV = ju- (10.25) 


Then it is straightforward to evaluate the contribution to the action 


sesi fate anii) (Odie) - (mai) (Ai) 
=5 fte- (25) Go) + (Si) (AAI). 0020 


The second term in the first line vanishes after integration by parts, the second 


term in the last line vanishes since ô, commutes with 1/0 and 0, J, =0 by 


current conservation, which is necessary for the consistency of the dual Maxwell 
equations and is assumed to be verified by the current. Then 


ioni N 1 jee) 


as desired. To calculate it explicitly is not too difficult. First of all, j,,(2)j,,(a’) = 
(sin@sin&’ + cos@cos6") p(x) p(x’) = cos(@ — 6’)p(x)p(x’), thus we get, writing 
d?x =dx,drq and dx’ = dxi dah 
g2 — aa eze A (; rr’ cos(6 — 6") p(a) p(x’ )drd6dr’ de" ) 
(a1 — 11)? + (a1 — x1)? +r? +r’? — 2rr’ cos(0 — 6’) 
(10.28) 
The integral over 0 and 6’ can be done explicitly, we leave the reader to work it 


out or find it in tables, giving 


1 W 
sy = fd @adrdr'— 4 (=== -1) 10.29 
f vd xdrdr! = ple)o(a!) ( (10.29) 
where, writing (x1 — 71)? + (£1 — 141) + (2-2)? =|%-2"|? 
y LE (ws = ah) tr 
7 2rr’ 
o (#1 — 24)? + (z1 -21 + (R42)? + (R42)? 
7 2(R+z)(R+z’) 
z-z? |#—2#"/?(z4+2z’) 1 
=14+ 55 om +0( 5 (10.30) 


and intriguingly the terms 1/R exactly cancel. Then expanding carefully 


W R z+ |z|,z, 2 
l= 1 10.31 
wear pea tae te E oe 
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and we note that actually the numerator only contributes to the the terms that 
have been neglected. Then in the evaluation of the contribution of this term to 
the action, the third term in Equation (10.31) vanishes because of parity when 
the lowest-order, spherically symmetric monopole charge density is put in for p, 
and the net remaining is simply 


1 R l 
2,loop 2 2 1 1 
Sr = fa xd xdrdr 7P0(2) po (x ) iS ] = 1) +o (=) f (10.32) 


The first term is exactly the Coulomb energy in the magnetic field while the 
second is proportion to the magnetic charge squared, 


1 1 
SZOP — on RM Abelian — if +o( a) (10.33) 


where g is the magnetic charge. The first term exactly cancels against the 
identical term found in SŁ, which is expected, since it arises solely because 
of the somewhat artificial Abelian magnetic charge density that was invented to 
extend the Abelian integration into the core. No physical phenomenon should 
depend on it. Thus 


1 
SL + S2loop — -739° + 20RM. (10.34) 


The interaction part of S}, which we will call Sa m is, integrating by parts 
and using the equation of motion, 


> 1 ~- 
spt = fates ieee fy far =- | dajac à (10.35) 


The external gauge potential can be taken with circular symmetry as 


1 
ag" = (0,0,-5B(R + z)sinð, 5 B(R+ 2) cos6) (10.36) 


and the current is 
ju = (0,0, — sin 8, cos 4) p(Z). (10.37) 


Then using dfx = dxidrgdrdér = dxıdzədzdð(R + z) = d3a(R + z)d0 and 
integrating over 0 gives a factor of 27 so that we get 

sor = - f dan B( (R+2z)?o(@) =—gn BR? + (10.38) 
Thus the total action is 


1 1 
Sg =27M — gBTR — 79° +o) (10.39) 


We vary the action with respect to R and demand that it be stationary to find 
the radius of the loop, 


0= =2nM — 2grBR, (10.40) 
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which gives R = M/gB. This is exactly the same value as in the case of the 
point-like monopoles, therefore we see that the inclusion of the Coulomb energy 
does not affect the radius of the loop. Inserting the value of R back into the 
action yields 


=9?, (10.41) 


and we observe that the Coulomb energy is ~ 1/R integrated over a circle of 
circumference 27R, which yields g?/4 which is independent of R. Finally, if we 
take the second variation we find 
2 

vee = —2gBr <0, (10.42) 
which means that the action has at least one negative mode and hence is at a 
saddle point. The negative mode is expected and gives rise to the decay width 
of the magnetic field. 


10.6 The Fluctuation Determinant 


We must now take into account the Gaussian integration over the fluctuations 
around the instanton 


K= wa (10.43) 


The factor of one-half occurs since we integrate over only half the Gaussian 
peak for the negative mode and any Faddeev—Popov factors are assumed to be 
included in the determinant. We have put the numerator in absolute value signs 
so that the negative mode does not give an imaginary value when we take the 
square root, as we explicitly put the 2 in by hand, in that the energy obtains an 
imaginary part E = E + il with 


DT =VKe-%2/"(1 + o(h)). (10.44) 


We must separate the zero modes, there are five, coming from four translations 
and one from internal rotation. The translation modes will give a familiar factor 


)) 


The internal rotation actually corresponds to the dyonic degree of freedom, 


of the square root of the normalization 


—1/2 


ı | Sp 


(10.45) 


4 1/2 
1 N 
K H 
i a l (45) 


internal rotation at a given angular frequency gives rise to a magnetically and 
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electrically charged state, called the dyon. The full rate of pair production and 
consequent decay of the magnetic field must include the production of pairs of 
dyons. But for the lowest order, we can restrict ourselves to the case of a simple 
monopole pair production. The internal rotation is intimately connected with 
gauge fixing and the Faddev—Popov factor. 

The translation zero modes naively are not gauge-invariant and must be made 
so by an accompanying gauge transformation, we find 


(6A), =0) A, Di Apes 
(60), = pọ- [A v9] = Do (10.46) 


and the normalization is (no sum on v, sum on a assumed) 


N,= f de (Erari pason], (10.47) 


The calculation of the determinant is possible in the limit R —> co (B > 0). 
In this limit, the fluctuations separate into those that change the shape of the 
monopole and those that change the shape of the loop. 
Using the circular symmetry and the gauge Ag = 0, we have 
SE = 07S 5 1 8? 
SP Vins. OO r? 002” 


where the first term depends on 2,y,r and is essentially a three-dimensional 


(10.48) 


3,inst. 


operator, while the second term comes from the kinetic energy, for example, 


1 1 8? 
Dy Dy = D1Dı + D2D2+ D,D, +- D, + -35ga (10.49) 
Eigenfunctions admit a separation of variables as 
cos(né =0,1,2,--- 
W(x1,22,7,0) = Y(£1, 22,7) pee (10.50) 
sin(n@) n=1,2,3,--- 
and then in the sector of angular momentum n we get 
ô SE ô SE n? 
= Fi 10.51 
Oo; inst. Ob} 3,inst. r? l ) 
Now we make an expansion in 1/R, with z =r — R, then, for example, 
1 
(DaDa) = Ds, De, + Deg Dra + Dz2Dz + Rp” 
yeh ye e? (10.52) 
ia au a R R2 a . 


To lowest order (1/R)° we just get the operator corresponding to the second 
variation of the Hamiltonian with a static monopole at #=0 


SE ôH 1 
ae Te =H 10. 
õp; 3,inst. õp; 3,mono. we (=) ( j Pa 
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The angular momentum term also admits an expansion 


2 2 
zs ete (1 Zt) (10.54) 


r2 R2 


so that to lowest order we have 


CH 1 n? 13 x ania 
& -o( 5) iato, J= y (10.55) 
i 13 mono. 


and we note that the angular momentum term is a constant. The eigenvalues are 


then simply 
2 


i n 
`l = wi + RD’ (10.56) 
where w? are the eigenvalues of EH . The A™ admit an expansion in 
i '3,mono. 
1/R as 
2 2 
(mo 2 n a | ON tci 
Ai = Wi T R2 T R2 + R4 + Ba (10.57) 


where the odd powers vanish as the order zero eigenfunctions have definite parity 
under z + —z. The correction a; is difficult to compute, but it is expected to give 
a small correction for the non-zero eigenmodes. To calculate them in principle, we 
must find the correction to the instanton to order o(1/R*) and then compute the 
correction to the eigenvalues to second order in perturbation theory. However, 
for the zero modes the correction is important, but easily calculable. 

There are three translational zero modes; first, consider the modes for 
translation in the x; and x2 directions. These are out of the plane of the loop 
and correspond to we =0 and ie =0. For these n = 0 and 0) =0= 2, 
Thus for these to remain zero modes to order 1/R we must have az, = Gz, = 0. 
For translation in the z direction, we see these are translational zero modes 
of the monopole in the plane of the loop. These must come with multiplicity 
two as there are two independent directions for the translation. Furthermore, 
they must deform the loop, hence they must correspond to n Æ 0. Indeed, the 
first deformation of the loop occurs for n = 1 and the two independent angular 
eigenmodes give the two independent directions of the deformation. Thus we 
require that a9 = a9 z= ag = 0. For the zero-order Hamiltonian, we already 
have w2, =0 and w2, =0, hence to order 1/R we must have 


az 1 az 


=T 


> a,=-l. (10.58) 


We perform exactly the same separation of variables and analysis for the 


z e(z) (10.59) 


denominator in Equation (10.45) 


8? Sp 
Sp; 


ô? H? 
30 SO; 
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which gives 


eee, (10.60) 


The determinant corresponds to the product of the eigenvalues, thus the angular 
momentum family corresponding to eigenmode i contributes as 


1 (0) (0) ~ (n) (n) 
mK: =-7 (ma — nr} oe (Ini = Inv) ) ; (10.61) 


where the factor of 2 is because all the n #0 modes come with multiplicity 
two while the mode n = 0 is solitary. To perform the summation we use the 
Euler—Maclaurin formula [2] 


N N 
0) +2 f(n)=2 (/ dsf) +f(N)+B( F(N) — f (0)) 


— Bal f"(N)— FO) += (10.62) 
where the Bi s are the Bernoulli numbers and f(n) = In(w? + ue Se oe 
In(w o + 2 2 + Fr +-+). For large n, we expect that Nr) > NO, hence 
F(N), F(N), fF” (N),--- all vanish. Also since Mr) is actually a function of n? 
the odd derivatives vanish at n = 0, and only the first term contributes, giving 
(letting Ry = x) 


S Qi Qi, 
nk, =—R | ue + y? 4 ca In(w o Hy? 4 = | -)) 
=-R | dy | 1 | E 
= y n Ww? a TO R2 
1 
=—Rr(w? —w79) +0 (2) (10.63) 


This follows from using the integral, 


N/R y N/R 
= Rf dyln(w? +y?) Ryln(w? +y?) — 2Ry + 2Rwarctan + 
0 W 10 


N? N 
= 2 
=Nlin (a R2 ) 2N +2Rwarctan (+) 


N? 
=Nin (e +i) — 2N + Rnw (10.64) 


taking N —> oo. This approximation is fine for all the angular momentum families 
that do not have exact zero modes. For n = 0,1 we would get a vanishing result 
and singularities in the amplitude. 
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We can, of course, still apply the method to the comparison theory of the true 
vacuum without the monopole. Here we get 


= 5 nA 3+ oh a imag (o wig tas a) 


N/R 7 5 N2 1 
=R f ade + Fz) +o( 5) 


= Rrwio+(2N 4 ym ($) Nto 5); (10.65) 


This follows from the integral Equation (10.64) after adding $ In (u? ot x) and 
expanding for large N. 


For the three zero modes, the sum over a9 for a = %1,2%2,z is done explicitly 


excluding AS, 4%? and AP (with multiplicity two). We will use the Stirling 


approximation nN! x NinN-—N+4 5n(27N). For a = 21,22 we get, noting 


w2 =0 
N N 2 
1 
-N` ha =- N] s ze 
daar D (Ge) +a 
N 
1 N! 
n=1 
=-2 (wmv w+ TED +2NInR 
=-2N1 N +2N —In(27N) + z (10.66) 
= n| R n(27 ol}: i 


Then subtracting the true vacuum result, Equation (10.65), from the result in 
the presence of the instanton, Equation (10.66), we get 


Inky, =— Rr (wr; —wr.0)—ln(2rR) and Inky, =-—RT(wr, —wgz.0)—ln(27R) 


(10.67) 
keeping in mind that w2, =w?, =0. For a = z we have a = 0, thus we must 
perform the sum 

1 N 
aes (0) _ (n) 
5 PAPI A mag, (10.68) 


n=2 


where we have put absolute value signs around a9 as it is negative (and the 
i is taken out explicitly in the Equations (10.43) and (10.44)). As w2 = 0 and 
= —1, we get A = —1/R. Furthermore, putting \S") = (n? — 1)/R, we get 


-5u (a =) i (10.69) 


1 N 
ee! (0) __ (n) sue? 
5 in|Az | D = 
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We evaluate the sum as follows 


E PA OE E 


we z+in]] (1-5) -2mm f] (MEY) 
=2InN!—2(N DmR+Y (m (224) n(-=)) 


N 
=2NIn (5) — 2N +ln2rN +2ln R- ln(2) (10.70) 


as the final sum is telescopic and gives the — In 2. Adding the —} In (Gz) =InR 
gives 


N 
1 N 
pope (0)) _ (n) _ = i i 
z inlA: | Z N Ink (2vm (5) 2N +ln2rN +2ln R m(2) 


=—2NIn (3) +2N —In(xNR). 
(10.71) 


Then subtracting the vacuum result, Equation (10.65), we get 


In K, = -Rr (w; —wz,9)—InnR?, (10.72) 


where of course w, = 0. Thus finally adding all the three contributions together 
we get 


X hk; =—Rr(wi—wio)—2ln2rR-lnrR? = Rr(wi—wio)—ln4r? R4 (10.73) 


or equally well 
1 


~ ARI 


The sum $ X; (wi —wi,0) has a perfect physical interpretation as the renormalized 


eT Bali (wi- 43,0) | (10.74) 


energy of the magnetic monopole due to vacuum fluctuations about the monopole 
configuration. This energy is properly subtracted with the energy of the vacuum 
fluctuations about the true vacuum. Thus we write 


= sD — wio) = AM. (10.75) 


The Faddeev-Popov factors, which we have not explicitly dealt with, will 
also contribute; however, their contribution also simply contributes to the 
renormalization of the mass of the monopole. 
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10.7 The Final Amplitude for Decay 


The final thing we must calculate are the normalization factors of the translation 
zero modes using the explicit expressions for the zero modes given by 
Equation (10.46). We use the coordinates x1, £2,r,0, but will rather use r = z+ R. 
First for the directions i = x1, £2, circular symmetry gives a factor of 27. The field 
strength and covariant derivatives of the scalar field are independent of the 0 
direction, i.e. Fọ „ =0,D9¢ = 0. The dominant contribution comes from the 
regions near z = R. We can use spherical symmetry in the three independent 
coordinates ©1,2%2,z. Then the normalization is given by, 


=N; 2r | derdezdrr (Eraras Eet (Dip)? 03) 
N nk | deıdezdr (£ FiF mi t (Dip)? 03") 
_?2 R 
5 fe DFG PG + ( (Dip)? (Dio)? (10.76) 


as, for example, F} + F54 = (2/3 (F2 + F31 + F52) = (1/3) X jr Fik 
For the mode i = 3,4 we get a similar expression, but there is angular 
dependence. Then, for example, D3 = cos0 D, and we get 


= N3 2r | dxıdezdrr (= Fis Fiz + (D3¢)* 9") 


=R fae fæ XO FLFS +(D.¢)° (Dz)* | cos?0 
i=1,2 
1 DAT. 


= N;— dO cos” g=5N (10.77) 
‘2m Jo 


ag> tg 
which can be related easily to the monopole mass. The monopole mass is given by 


Thus we only have to evaluate the integral f d?x (E; FS FS + (Dip)? (Did)*), 


M= fae (FRAP +5 (D9) D+). (10.78) 


However, the expression for mass, which is the energy of the monopole, must 
be stationary with respect to arbitrary variations for the fields. Making a scale 
transformation (x) > (ax) and A(x) > aA(ax) and demanding the mass be 
stationary at a= 1 gives 


[ae( (Grim) -500 Dye)" -3v(6)) =o. (10.79) 
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Thus 1 1 1 
f tavo) Sa per S5 (D?, (10.80) 
which gives 
1 se T argal é a 1 Ni 


So N; =27Re?M and N3 = N4 = n Re? M. Thus 


N; 1/2 N; 1/2 
€ 5) =(RM)/? i=1,2, ( t ) =(RM/2)/? i=3,4 (10.82) 
TE 


and 


4 1/2 2 
1 N; RM 1 M 
K ge l | a kK! = RA { —Rr2AM = —Rr2AM 10.83 

2 T (= ) A 2 4r R4 f 8r3 E ( ) 


Then putting in the factor for the classical instanton action we get the final 
expression for the amplitude of the decay of the magnetic field 


M2 
T= an e- RT2AM 6—("M?/g?B—g? /4) (10.84) 
Using M/R = gB, writing Mren. = M + AM and assuming AM «&« M 
2 p2 
= IZ e Men. / 9° BO"), (10.85) 


We have not taken into account the zero mode corresponding to internal 
rotations. As we have mentioned, this mode corresponds to the dyonic excitation. 
Without the creation of dyonic pairs, the zero mode will give a factor of 


J \i 
— 10. 
(=) ; (10.86) 


where J/R is defined to be the normalization of this zero mode. J is calculable 
from the exact solutions for the dyons as is the mass of the dyon [66]. There is 
a whole family of dyon solutions with all possible charges, all of which can be 
produced in pairs. We will not treat the calculation in detail here and refer the 
reader to the original article [4]. We simply quote the final result, writing Fm 
for the pure monopole result Equation (10.85) 


J 1/2 c oo 
T= (=) Serer DY e Oe gS 


=00 —oCo 


using the Poisson summation formula 


df (m) = 2 ( Í dre?™™™? f w) (10.88) 


and performing the ensuing Gaussian integral and that M/R = gB. 
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Quantum Chromodynamics (QCD) 


11.1 Definition of QCD 


Quantum Chromodynamics is the theory of strong interactions. It is a non- 
Abelian gauge theory based on the gauge group SU(3), which is called the 
colour gauge group. The gauge symmetry is preserved in this theory and, 
specifically, it is not spontaneously broken. The gauge bosons that carry the 
strong interaction are called gluons. The matter content of the theory consists 
of quarks, which are spin one-half fermions that transform according to the 
fundamental representation of SU(3), that is a three-component, complex triplet. 
The quark model was proposed in the 1960s and 1970s and elaborated in its 
incorporation into the “standard model” of particle physics corresponding to a 
gauge-theoretic description of the strong, weak and electromagnetic interactions. 
This model is now at the level of a confirmed theory. An untold number of 
experimental data have shown the existence of quarks and gluons, in addition to 
the matter content corresponding to the non-strongly interacting particles, the 
leptons, and the corresponding gauge bosons of the weak and electromagnetic 
interactions, which are known as the W and Z gauge bosons, and the photon. 
The strong interactions govern the interactions that give rise to nuclear forces. 
The matter that experiences these forces is generally called hadronic matter. The 
hadrons split into two categories: baryons, which correspond to the neutron, 
proton and atomic nuclei, which seem to be stable; and mesons, such as the 
pions, kaons and others, which all seem to be unstable. The fundamental building 
blocks of the hadrons are the quarks. The quarks interact directly with the gauge 
bosons of the colour SU(3) gauge group, which are the gluons. The quarks have 
colour charges and couple directly to the gluons, which themselves have colour 
charges. However, it is believed that the QCD vacuum is such that colour charges 
are confined, that free colour charges correspond to states of infinite energy. 
Therefore, the observable hadrons must all be colour singlet states. The baryons 
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correspond to the bound states of three quarks, and a colour singlet in the three- 
fold tensor product of the fundamental representation 3®3®3=10@68@8 61. 
The mesons correspond to bound states of quarks and anti-quarks, 3893 =8@1. 


There are many other possibilities for obtaining singlets, but these have not been 
experimentally observed. 


11.1.1 The Quark Model and Chiral Symmetry 


In the 1960s the quark model of hadrons was invented, with contributions from 
many different authors coming independently. It was understood that quarks 
come in many flavours, and these were named up, down, charm, strange, top, 
bottom, and more, if necessary. In our daily experience, we only encounter the up 
and down quarks. During the 1960s and 1970s, it was discovered how the quarks 
fit together to give rise to the observable hadrons, and also their interactions 
with the non-hadronic particles called generically leptons, the electron, muon, 
and taon, their neutrinos. The quark model seemed to indicate the existence 
of families of elementary particles, which bring together the strong, weak 
and electromagnetic interaction with gauge group SU,(3) x SU(2) x U(1), the 
gauge group of the standard model. Models of grand unification correspond to 
the inclusion of this group inside a single, semi-simple group, with symmetry 
breaking giving rise to the observed symmetry group of the standard model. The 
SU.(3) is the colour gauge group of QCD. The weak interactions are mediated by 
the SU(2), while the U(1) corresponds to what is called weak hypercharge. The 
weak SU(2) is spontaneously broken to a U(1) subgroup, the by now celebrated 
Higgs field and Higgs mechanism, and the actual electromagnetic U(1) gauge 
group corresponds to a linear combination of this unbroken remnant of the weak 
SU(2) and the U(1) hypercharge gauge symmetry. We will not elaborate the 
full standard model here, it is out of our interest and there are many very good 
references that describe the standard model in all its detail. For us it will suffice to 
know that the left-handed quark fields and the leptons feel the weak interaction, 
which only acts on left-handed fields, and transform according to the doublet 
representation of the weak interaction gauge symmetry. All right-handed fields, 
quark or lepton, do not feel the weak interaction, and only feel the strong and 
electromagnetic interaction. 

The first family comprises left-handed up and down quarks forming a doublet 
of the weak interactions based on the group SU(2) and transforming individually 
according to a U(1) charge called weak hypercharge, along with the left-handed 
electron and its neutrino, which also form a weak doublet with their respective 
weak hypercharges. The family is completed with the right-handed partners of 
the up and down quarks and the right-handed partner of the electron. The 
neutrino was not supposed to have a right-handed partner; however, this is no 
longer certain as it has been observed that the neutrinos must have mass. For the 


11.1 Definition of QCD 203 


purposes of this book, we will not add a right-handed neutrino. The right-handed 
partners of all the particles did not experience the weak interaction but did 
experience the weak hypercharge, and each member had a corresponding value 
for the weak hypercharge. The second family comprises the charm and strange 
quarks and the muon and its neutrino; and the third family comprises the top 
and bottom quarks and the taon and its neutrino. Chiral symmetry corresponds 
to the notion that there is a complete symmetry under unitary rotation of the 
quarks amongst themselves. In principle, this would correspond to a “flavour” 
symmetry group of SU;(6). 

Chiral symmetry is, explicitly, badly broken by the mass spectrum of 
the quarks. The best preserved subgroup is chiral SU(2) (which is also, 
coincidentally, the weak interaction symmetry) corresponding to iso-rotations 
of the up and down quarks amongst themselves as these quarks have masses 
in the range of a few MeV, which is almost negligible at the scale of the 
strong interactions. Including the next lightest quark, the strange quarks gives 
rise to chiral SU(3) symmetry, which is broken at a 10% level as the strange 
quark mass is around 100 MeV. This symmetry was named SU;(3), the three- 
dimensional unitary symmetry of flavour. Identification of this symmetry led to a 
great advance in the organization of the hadronic particle spectrum. This meant 
that the Lagrangian of the quarks was made up of three fermionic fields and it 
is invariant under the unitary rotation of the three fields into each other. The 
energy eigenstates then must form representations of this group of symmetry, 
much like the energy levels of the hydrogen atom form representations of the 
group of spatial rotations, SO(3). Even though the SU; (3) is broken at the 10% 
level, the physical hadrons, which are the energy eigenstates of the theory, are 
easily identifiable as being members of various representations of this symmetry 
group. The baryons form the representations 8 and 10 of SU;(3), while the 
mesons fall into the 8, as shown in Figures 11.1, 11.2 and 11.3, which were 
created by [84, 83, 82]. 


0 
s=1 K K+ 


q=-1 q=0 


Figure 11.1. QCD flavour diagram of the meson octet 
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SU;(3), being a symmetry of the theory, cannot be responsible for the strong 
force between the hadrons. The strong force must be independent of the flavour 
symmetry, for the flavour symmetry to manifest itself as a symmetry of the mass 
spectrum. The charm quark mass is about 1.2 GeV and the top and bottom 
masses are over 150 GeV, hence invoking chiral symmetry including these quarks 
is quite unrealistic. But what was holding the quarks together? 


11.1.2 Problems with Chiral Symmetry 


1. Chiral SU(3) symmetry implies the existence of multiplets of hadronic particle 
states, which have all been observed, and brings order to the chaos of the zoo of 
observed hadronic particles. However, there is a problem, as chiral symmetry 
predicts hadronic states such as the A** which is made of three up quarks or 
the A~ the corresponding states of three down quarks or the Q7 that of three 
strange quarks, each of them in a spin 3/2 state. The problem has to do with 
their wave functions. The three quarks should be in a spatially symmetric 
state as there is no additional angular momentum, a spin-symmetric state 


s=0 n p 
s=-1 > >t 
q=1 
s=-2 
= =0 
q=- q=0 


Figure 11.3. QCD flavour diagram of the baryon decouplet 
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giving rise to the spin 3/2 of the state and an iso-spin symmetric state as the 
iso-spin of each quark is identical. Such a state is not permitted for fermions 
by the Pauli exclusion principle, which requires that the wave function of 
identical fermions must be anti-symmetric under the exchange of any two. 
Therefore the quarks must have another, hidden quantum number, and the 
wave function of the state must be anti-symmetric under this hidden degree 
of freedom. 

. There exists a second experimental reason why the quarks should come in 
three colours. The ratio 


_ olete > qq) 2 
R= ee = LG (11.1) 


is simply predicted by perturbation theory, where Q; is the electrical charge of 


the quark. This ratio is measured experimentally and gives a rising function of 
the incoming energy, with a few isolated peaks corresponding to resonances at 
the positions of particles. However, it reaches a first plateau with a numerical 
value of 2 when it crosses the threshold for production of the strange quark. 
Now the sum over the charges of the lightest quarks, up, down and strange, 


1 1 . . 
3; —3, İs given by 


D HOE ( :) ṣ ( Naz (11.2) 


lightest quarks 


which are, respectively, 2, — 


Clearly if each quark came three times with three colours we get the required 
value 2. Increasing the energy of the scattering, once we pass the charm 
threshold at about 1.2 GeV, the value of R increases to a second plateau 
at 34. This corresponds exactly to the addition of the charge of the charm 
quark squared, Cie x 3. Finally after crossing the bottom quark threshold at 
an energy of about 4.2 GeV, the value of R again increases to a plateau at 
32 corresponding to the charge of the bottom quark, appropriately G)? x3. 
. Another experimental reason for three colours has to do with the decay rate 
of the neutral pion to two photons, 7° + 2y. This decay is mediated by the 
so-called anomaly diagram. The amplitude for the decay predicted if only one 
quark is circulating in the triangle is exactly three times too small from the 
observed amplitude. 

. “Anomaly cancellation” gives another reason to believe that there must 
be three colours. As mentioned, part of the flavour symmetry group 
is actually also gauged and gives rise to the weak and electromagnetic 
interaction. Sometimes gauge symmetries are broken by quantization of chiral 
fermions. A gauged symmetry must be respected at the quantum level; it 
is necessary to prove the renormalizability of the theory. Invariance under 
gauge transformations for the quantum theory is used in an essential manner 
to prove renormalizability. Therefore, it is imperative that the weak and 
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electromagnetic gauge symmetries are anomaly-free. The anomalies of the 
corresponding gauge group all potentially reside in the weak hypercharge U(1) 
symmetry. The weak hypercharge of the left-handed up and down quarks is 
3 while that of the right-handed up quark is 4 and that of the right-handed 
down quark is -2. The left-handed leptons, the electron and its neutrino, have 
weak hypercharge —1 while the right-handed electron has weak hypercharge 
—2, and we are assuming that the right-handed neutrino does not exist. 
The anomaly is proportional to the sum of the cubes of all the left-handed 
hypercharges minus the same for the right-handed charges. We must not forget 
that the quarks each come in three colours, giving an additional factor of three, 
and then this gives 


3x (2) sx (2) enren- (3x (5) +x (P) eat) 
(5) 2 (F)+(G)+8=0 (11.3) 


5. Finally, there has to be some mechanism by which the colour degree of 


freedom is not seen in hadronic states, and has to be confined. There is a 
good theoretical indication why a non-Abelian gauge theory could supply 
the correct interaction. First of all, the colour degree of freedom is flavour- 
blind, it is identical for each flavour. However, QCD being a renormalizable 
theory, we can perturbatively calculate the renormalization of the coupling 
constant. Non-trivial renormalization means that naive calculations of, say, 
the perturbative corrections to the coupling constant give infinite answers. 
However, by scaling the bare coupling constants of the theory appropriately, 
all the infinities can be absorbed into these inobservable, infinite, bare 
coupling constants, while the physically observed coupling constants are finite 
and defined at a chosen energy scale. However, then the value of the coupling 
constant at different energy scales is predicted by finite scaling, which is 
called the renormalization group. Perturbative calculations indicate that as 
the energy scale is increased the value of the coupling constant decreases 
(rendering, in fact, the perturbative calculations, which are valid for a small 
coupling constant, more and more precise). Evidently for lower and lower 
energies the coupling constant must increase. These properties are called 
asymptotic freedom at high energies and infrared slavery at low energies. 
Of course, the perturbative calculation becomes less and less reliable as the 
coupling constant increases, and hence actually only indications of infrared 
slavery are predictable via the perturbation theory. Nevertheless, the picture 
for quarks emerges, that when they are close together, at short distances which 
correspond to high energies, they are essentially free and non-interacting. 
However, as they try to separate from one another, at long distances, the force 
between them increases and, in principle, it would require infinite energy to 
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separate them infinitely far. The theoretical prediction of asymptotic freedom 
has been observed experimentally. When very high electrons impinge on 
a hadronic target and suffer deep inelastic scattering, they scatter off the 
individual quarks, which, because of the high energy of the electrons, are 
being probed at very short distances. The quarks then should behave as 
free particles. This is exactly what is observed. The deep inelastic scattering 
cross-section for electrons on hadrons exhibits the property of scaling, that 
the cross-section is simply that of an electron scattering from a free quark 
of momentum x x py, where py is the total momentum of the hadron and 
x is the fraction of that momentum carried by the quark, multiplied by a 
factor that corresponds to the probability of finding a quark with momentum 
fraction z. 


Thus the colour degree of freedom arose, and making it a local gauge degree of 
freedom gave the added bonus that it provided a means for obtaining interactions 
between the quarks that would in principle bind them together. 


11.1.3 The Lagrangian of QCD 


The Lagrangian density of N flavours of free quarks is given by 
N 
L£=> 08 yO, — me) ws. (11.4) 
a=1 


The label a corresponds to the different flavours, while the label a corresponds to 
the colour and the summation over repeated colour, flavour and Lorentz indices 
is assumed.’ Interaction terms involving just the fields Ya themselves, such as 
(hae)? or (payye) CARA and any others, are not renormalizable. To 
have interactions between the quarks, we must add other fields such as gauge 
fields or scalar fields with which the quarks interact, and then with each other 
through the exchange of the additional particles. We will consider the idea of 
gauging the added SU (3) colour symmetry, the symmetry in any case seems to 
be required for the existence of fermionic statistics of the quarks in some of the 
hadronic states. 

The colour degree of freedom corresponds to the index a, which goes from 
1 to 3, and we will now add gauge fields corresponding to making the gauge 
symmetry SU(3) local, 


N 
£= D105 Ci" (Op + An) = M°) ba: (11.5) 


1 The colour metric or the flavour metric are both simply the identity matrix so we will write 
the indices above or below depending on convenience. The Lorentz indices are summed 
with the Minkowski metric, thus for these we will rigorously only sum a raised index with a 
repeated lowered index. 
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The covariant derivative D, = 0, +A, now appears with A, = iA’ Ai, i=1,---,8, 
and the A; correspond to the 3 x 3 Gell-Mann matrices 


0 1 0 0 —i 0 1 0 0 

à=ļ| 1 0 0 4,A=] ¿i 0 0 |,As=] 0 -1 0J, 
000 0 0 0 0 0 
0 0 1 0 0 -i 00 0 

As= | 0 0 ,As=] 0 0 0 |,As=]}] 0 0 1 
1 0 0 i 0 0 010 
00 0 1 1 0 0 

sal Aes Sh ai oes, “ioe a (11.6) 
0% 0 v3 0 0 -2 

The Gell-Mann matrices satisfy the Lie algebra of SU(3), 


where f'k are the structure constants of SU(3). The structure constants are 
completely anti-symmetric, fitik = — fJ** = — fiki with f1? = 1, f14?7 = — f156 
fP = fF = PO = — S367 = 1/2, F458 = fo = 4/3 /2. To this action, we add the 
Lagrangian for the gauge fields 


Leauge = = 1 pi p", (11.8) 


where, as previously defined, Fi; is obtained from 
[Dus Dy] = iFj Ass (11.9) 


explicitly 
Fi, = ð Ab — 0, Al, — fA AS. (11.10) 


Our aim in this book is to consider the importance of the classical solutions 
to the Euclidean equations of motion, the instantons. Thus we will write the 
Euclidean Lagrangian density as 


aoe 1 
Lp = Fl = =T (Phd pv) (11.11) 


where now the Lorentz index becomes a Euclidean vectorial index and the metric 
in Euclidean space is just the identity, hence we change the sign in the first 
equality, and 


Fav = ð pAr — 3y Ap + [An A], (11.12) 


which is an anti-hermitean matrix-valued field. 
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11.2 Topology of the Gauge Fields 


We shall look for configurations of finite Euclidean action 
Sg = | zkr. (11.13) 


We assume that for large radius r in four-dimensional Euclidean space, the gauge 
fields can be expanded in powers of 1/r. For finite action then, F y must decrease 
as o(1/r?), where o(1/r?) means faster than 1/r?. This implies that the gauge 
field must decrease at least as o(1/r) up to pure gauge terms 


A,=0(=) +g(9Q)3 g (Q), (11.14) 


where g(Q) is a function only of the angular variables Q at infinity. Then 
g(Q)ð g7 (Q) ~ 1/r, and this yields the required behaviour for Fy 

But g(Q) is defined essentially at infinity of Euclidean spacetime, R4, which 
is topologically the three-sphere S$. Thus g(Q) defines a mapping of the three- 
sphere at infinity into the gauge group SU(3), 


g(Q) : S? 4 SU (3). (11.15) 


These fall into the homotopy classes of mappings which define the homotopy 
group Ia(SU(3)). Gauge group configurations gı(Q) and g2(Q) can be 
continuously deformed one into the other only if they fall into the same homotopy 
class. We write gi(Q) ~ g2(Q) if they are in the same homotopy class. The 
homotopy group is well known, 


I(SU(3)) =Z, (11.16) 


where Z corresponds to the integers, and an integer corresponding to a 
homotopy class is called the winding number. This means that each configuration 
can be associated with a class of homotopically equivalent configurations, 
which have the same winding number. Configurations with different winding 
numbers cannot be continuously deformed one into another, since the winding 
number can only change discretely. Continuous changes cannot change the 
winding number. Consequently, different gauge field configurations of finite 
Euclidean action must also fall into topologically distinct homotopy classes. 
A gauge field configuration A(x) with a limiting value defined by the 
asymptotic gauge group configuration gı(Q) cannot be continuously deformed 
into another gauge field configuration Aə(x) with a limiting value defined by 
the asymptotic gauge group configuration go(Q) unless gi(Q) ~ go(Q). If the 
asymptotic gauge group configurations are in different homotopy classes, the 
existence of a deformation of the gauge fields into each other continuously keeping 
the Euclidean action finite would be a contradiction, as it would provide a 
deformation of one asymptotic gauge group configuration into the other. 
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We might imagine that as the theory is invariant under local gauge 
transformations, we might be able to remove the asymptotic gauge dependence. 
Suppose we make a gauge transformation at infinity, 


g —> hg (11.17) 
for some group element h. Then the gauge field transforms as 


A, > h(Ap rook 
= h (gð g7 +0(1/r)+3,) ht 
= hg(ð g ')h t +hd,h~* +0(1/r) 
= hg(ð (hg) *) +0(1/r). (11.18) 


Thus if we chose h = g7! 


, we could eliminate g. But this is impossible because the 
gauge transformation h should be a differentiable function defined over the whole 
space R’. At least h should be a continuous function over all of R4. Thus if we 
define h = g~! at infinity, we must be capable of continuing the definition of h 
throughout space, including the origin. This is clearly impossible since the origin 
is a degenerate sphere on which the mapping must be trivial. This implies that 
the gauge transformation h must be in a class of gauge transformations that 
can be continuously deformed to the trivial mapping. Hence h cannot satisfy 
h= g7! at infinity, as g is not in the class of trivial mappings. Thus any gauge 
transformation h can modify g at infinity, but only within its homotopy class, 
g—hg~gq. The integer invariant corresponding to the homotopy class of g is 
seen to be exactly the Chern number of the gauge field configuration. 

We can explicitly construct the gauge transformations that give rise to the 
different classes of gauge fields 


g@)=1 


(1) i= wa +iz 
(wa +18)? 


(a) = (9)! 


(11.19) 


defined over each S that contains the origin. The gauge transformations are 
singular at the origin (except g). 
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11.2.1 Topological Winding Number 


We can explicitly calculate the winding number of the gauge field configuration 
through the following analysis. Consider the integral 


—1 G 
v= sey | abe Tr (loðs 1) (gð;g 1) (gOkg DE (11.20) 


where the integral is over a three-sphere with local coordinates 0;. For any local, 
infinitesimal transformation of g, g > g(1 + ôT) and g7! — (1 — ôT)g7! so that 
gg = 1 is unchanged. This means that with ôg = gôT and g`! = —ôTg7! we 
will show that v is unchanged. We then find 
5(gOng~') = gôT Ong” ' — gO (ST g7) 
= gôT 3kg! — g(O,6T)g~' — gôT 3kg! 
= —9(0,6T)g™'. (11.21) 


Thus the change in v is 


ov 


= 34,2 [exci rr (g0:6T) 97 (g0;9~')(gOxg7')) x3 


1 7 

Ee / Be" Tr (08T) (3g) gg *)g) 
—1 E B 

= aa | Poci*rr (0;6T)(O;9 ")(Ox9)) 
T 


where in the first line the factor of 3 comes because the contribution from each 
of the three factors is the same, in the third line we use g(Oxg~!)g = —Opg and 
the last line vanishes as the integral is of a total derivate over a three-sphere that 
has no boundary. 

We can evaluate v explicitly for g@. At the “north pole”, z4 = 1,2‘ ~ 0 then 
we can take 6* = qt 


9, ( om)" et +i- a, rtiz g 
g as north pole 7 (x4 a Ea a i (x4 + ele) 
: (xt —i#-@)x* 
= 104 = 
(FE? 


However, the symmetry of the configuration means that the integrand is the 


| 


x4=1,xri=0 


= —ic;.(11.23) 


at=1,27=0 


same at all points on the sphere. Hence, 


EIET r (gig) (gð) (gkg) = te" Tr (aio ;0K) 
= ie'l* Tr (i€;j1010%) 


= e 6; 51251, = —2-6 = —12 (11.24) 
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using 040; = icik op + dij. Thus 


v= atl 2» |do a z | ao=1, (11.25) 


since the volume of the unit three-sphere is exactly 27”. This is obtainable by 


integrating over the angular variables in R* in the generalization of spherical 
coordinates. It is easy to see the v(gig2) = v(g1) + v(g2), indeed, using form 
notation 


—1 3 
v(gige) = ae see | Tr ged(arge)- 5) 


= ps | T (ogldoz' e Fad) 
Sj 
Dn? 
x Jar (g192(dg3*)gT g1 (doy ')(g192(dg3 ')gy' +9149; ')) 
—1 
377 
x J Tr (g2(dgz ')(dg7")g192(dg7 ") + g2(dg7 ")(do7 Yg: (do7 g1) 
=v(g1) + vlg) + E5 | d (Tr (g2(doz dorg) 
=v(g1)+v(g92) 


=v(g1)+v(92)+3 


=v(g1)+v(92) + 


(11.26) 


where we have used d(gd(g~!)) =—gd(g~+)gd(g7'). 
We can define 


2 
Gy = 4€yvr0 Tr (AoA. + FA, AxAn (11.27) 
then 


0 Gp = 4Euvao Ir (a, A,OAgt= O AAAs + Ap On AAs + Az AO A d) 


= 4euvao Tr (Op, ALO, Ag +2 Ada 
= 4euvyo Tr ((0 Ay + ApAr)(O,Ac + AAo)) 
= emmo Tr ((3 Av — Op Ap + [Ap, Av])(O, Ao — Oo Ay + [Ay, Ao])) 


= Eyvro LT (FuvP yo) . (11.28) 
But 


1 
/ d'20,Gy = dS,,G, = dSp4euvao Tr (Ao = Av AsAr 
TCO 


(11.29) 


Toco 
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and since 
Fie =0( 5] + A, = gð +0(2), (11.30) 
then the first term in Equation (11.29) falls off too fast to contribute while the 
second term gives exactly the expression for v in Equation (11.20). Hence 
ae 1 
327? 


1 ` 
2 r | toT (ioti) (11.31) 


[acre (€pvro Puro) 


where the dual field strength is defined as F jw = hew no Eo: 
We can summarize our findings as follows. 


1. Each gauge field configuration of finite Euclidean action is associated with an 
integer, called its Pontryagin number. 

2. It is impossible to continuously deform one gauge field configuration into 
another with different Pontryagin numbers, keeping the Euclidean action 
finite. 


For any other gauge group, SU(3) in particular, there is a theorem by Bott 
[18] that says that any mapping of S° into a semi-simple Lie group G can 
be continuously deformed to a mapping into a SU(2) subgroup of G. Hence 
everything that we have shown for SU(2) is actually valid for any semi-simple 
Lie group G. The only thing that changes is the normalization in the formulae for 
the winding number. However, if we use the notion of the Cartan scalar product 
on the Lie algebra of G, defining 


(T°T”) = ô” = aTr rr) (11.32) 


then œ depends on the representation of the T°, but the formula for the 
Pontryagin number is universal 
1 4 ~ 
y= 3972 d (FF). (11.33) 


Now with the possibility of many inequivalent classical sectors in the space 
of field configurations, we expect the existence of the many different vacuum 
configurations, and of course the possibility of quantum tunnelling between 
them. 


11.3 The Yang—Mills Functional Integral 


We begin with the functional integral 


a 1 D j 
T=N f DAc!* aga Fur Ew) (11.34) 
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We must fix the gauge, we will choose A3 = 0. We then have the following 
observations: 


1. It is easy to see that all gauge field configurations may be put into this gauge, 
simply take 


z3 
h=P (cw / te! Aa(a! 22,2 24)) : (11.35) 
Then 
hdsh—1 =—hA3h7 (11.36) 
hence 
A’, = h(A3 + 03)h7* =hAgh~+—hAgh—* =0. (11.37) 


2. The Faddeev—Popov factor is just a constant. 


11.3.1 Finite Action Gauge Fields in a Box 


We will consider the theory in a finite spatial volume V, but always have in mind 
that V — œ at the end. The same for the Euclidean time T. We must choose 
boundary conditions on the walls. We will choose the boundary conditions such 
that the bulk equations of motion are not modified because of them. The general 
variation of the action is 


aL aL 
= 4 | 
ös= fa sgg, + 55 4 OA 


ðL ƏL aL 
2 4 ASS a 
o ny het ea hs 
ƏL ƏL 
3 4 dace 
= fès nt feel aa a) As 
aL Ak 
3 4 
afa sii” FypõAp es (- F e) A (11.38) 


Therefore, to not have any contribution from the boundary we must impose 


A” F,,6A, =0 (11.39) 


on the boundary. We can decompose 5A, into its normal and tangential 
components, dA, = (A? )û, + JAI", where uA = 0. Then the 
boundary condition Equation (11.39) becomes 


AY Fy (0AP Jai, SAEI) = AY Fy 0 AtS = 0, (11.40) 


since Fy, = —F,,,. Thus we are required to fix the tangential components of the 
gauge field on the boundary and, consequently, we impose that the tangential 
components may not be varied on the boundary, so that oA re = 0. We must 
also respect the gauge-fixing condition, A3 = 0, and we are only interested in field 
configurations whose action remains finite as the box size is taken to infinity. We 
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Figure 11.4. Paths over the boundary defining the gauge group element 


will see that these conditions mean that the winding number inside the box 
must be a definite integer. We will show that the only vestige of the boundary 
conditions is that the winding number inside the box is a definite integer. 

Indeed, inside a large box of dimensions (Lj, L2, L3,T), gauge fields that 
remain of finite action when the box is taken to infinite size must have the 
behaviour 


A, = 49a! +0(*) (11.41) 


r 


on the boundary. g0,,g~' is obtained from the limiting values of the gauge field 
configuration, and hence must be continuously defined over the entire boundary. 
g is extracted by performing the path-ordered exponential integral, as shown 
in Figure 11.4, along a nest of paths that start at an initial point xj on the 
boundary at x? = —L3/2 and move along and cover the boundary to all other 
points on the boundary x” 


g(a") = P exp (- i. asau” ) ; (11.42) 


The integrability condition that the gauge group element obtained from the 
path-ordered exponential integral from two different paths is the same, and is 
exactly that the field strength vanishes on a surface whose boundary comprises 
the two paths. This condition can be easily verified for an infinitesimal loop. 
The field strength does indeed vanish for A,, = g0,,g~'. Thus g is continuously 
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defined over the entire boundary. g is the unique solution of the linear, first- 
order differential equation 0,,g~' = g~'A, (or equivalently 0,,g =—A,g), up to 
an irrelevant, multiplicative, constant gauge group element. Equivalently, the 
actual gauge group element defined by Equation (11.41) is also ambiguously 
defined up to a constant gauge group element go; we can simply take g —> ggo 


as then the gauge field A, = gô, g~t 


is invariant. The constant gauge group 
element is irrelevant, it does not contribute to the action or any winding number. 


The integration paths are perpendicular to the x? direction on the two faces at 


x? = +L3/2, hence g is necessarily independent of x°. Along the other surfaces, 
we integrate along lines parallel to the x? direction, but since A3 = 0 the gauge 


group element is unchanged. On the two surfaces at x3 = +L3/2, the gauge 
transformation is not necessarily the same. 

Specifying g on the boundary fixes only the tangential components of A,, since 
g only varies along the boundary that corresponds to the directions tangential to 
the boundary. The normal component of A, must also be given by the form given 
in Equation (11.41). However, these then depend on how g varies as we move away 
from the boundary into the bulk. The normal components of A, do not need to 
be specified, since all we insist on is that the boundary values do not contribute 
to the equations of motion. Thus we do not have to specify the variation of g as 
we move away from the boundary into the bulk. One thing is important, since 
the surface of the box is topologically S3, the gauge group element g defined on 
the boundary can perfectly well be in a non-trivial homotopy class of II3(G), 
and hence may not necessarily be continuously defined throughout the entire 
box. Indeed, g is only defined by the asymptotic behaviour of the gauge field on 


and near the boundary. 
34 


On the surfaces at x t 13/2, the gauge group element depends, in principle, 
non-trivially on the three coordinates (x!,x?,24) and g(a!,x?,—L3/2,x+) Æ 
g(a',x?,+L3/2,24) as in Figure 11.5. But on the surfaces that connect the 


boundaries of these two ends, since A3 = g03g7! 


=0 from the gauge condition, we 
must have that g is independent of x°. Thus the values of g on the boundaries 
3 —+1 3/2, i.e. for at least one of: x! = +L,/2, 
x? = +L3/2 or xt =+T/2, and z? =+L3/2, are the same. Now we will perform a 


gauge transformation by h(x!,x?,x?,«*), which is actually independent of x? and 


of the two end surfaces at x 


defined by the value of the gauge group element at the surface 23 = —L3/2, i.e. 
h(a’ ,a? 2°, x4) = gt (a 27, —L3/2,2°). (11.43) 
Then 


Ay => h(Ap +9,)h7" = g '(a',2”, -—L3/2,2*)(Ay +ô g(x, £’, -—L3/2,2*). 
(11.44) 
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9(X1,L9, +L 3/2,a,) 


9(X1,L2,04) 


G(X1,L2, -L3/2,a,) 


Figure 11.5. Boundary with the gauge group element 


But since A, = g0,g~!+0(+) for large r, we get 


Tr 


1 
Ay = Gala (oouo +o G) +04.) (dai) . (11.45) 


We emphasize that the g appearing inside the middle bracket depends on z? while 
that on the outside is independent of x3 and is equal to its value at x? = — L3 /2. 
Evidently we can then write 


1 
Ay=ndyar'+0(4), (11.46) 


where 
g(a") = Cae g(a”). (11.47) 


Evidently, gı is equal to the identity on the surface at x? = —L3/2 and also on 
the surfaces where x? changes, —L3/2— L3/2. On the surface at x? = L3/2, gı 
must then be the identity on the boundary of this surface (for at = +L /2 and 


so on), but for interior values of x',a? and a* generally g|,3-74/2(«',#7,«*), 


need not be equal to g|,3—_74/2(«',#?,a*). Indeed, if the instanton number of 


the gauge field configuration is non-trivial, then g;|,.3— L3/2 # I since the gauge 
transformation h of Equation (11.43) cannot change the instanton number. The 
instanton number given by the integral Equation (11.31) is gauge-invariant. The 
surface at x? = L3/2 with its boundary identified is also topologically $3, and gı 
defined on this surface goes to the identity at its boundary. This means that gı 
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is also defined on this surface with its boundary identified as topologically $°. 
Thus gı defines a map from S? —> G, which contains all the topological winding 
number information of the gauge field defined in the entire box. We want to see 
what happens under a change of the boundary conditions. We will implement 
this by fractionally changing the size of the box. 

We imagine placing the original box in a larger box that is extended along the 
x? direction by A with the gauge field configuration also extended into the larger 
box. In the larger box, after the corresponding gauge transformation, there will 
also be a gauge group element, g2, defined on the surface at 2° = L3/2+A, which 
is identity on its boundary (and identity on all the other surfaces of the box), 
just like gı on its respective boundary. We will extend the box in such a way 
that the fractional change in the volume is negligible. If we choose A = (L3)!/?, 
the fractional change in the volume is negligible, 


5V LlL4A A 


= = — =(L3)7/? 40. 11.48 
V iqisiel, Ls (Zs) ( ) 


Alternatively, the volume of the larger box is 


1 1 
A LP 


If gi and g2 are in the same homotopy class, we will show that all gauge 
field configurations defined in the smaller box can be extended to gauge field 
configurations in the larger box, with negligible change in the action. If gı and 
gz are not in the same homotopy class, this is not the case: there has to be at 
least one more instanton outside the smaller box which implies an increase the 
action by at least 8r?/g°, which is the minimum action of one instanton, as 
we will see in the next section?. This increase in the action is independent of 
the amount of the extension of volume of the box, even if the volume is only 
extended fractionally, negligibly. Thus for gı and g2 in the same homotopy class, 
the action changing negligibly means that extending the box is simply equivalent 
to a changing boundary condition gi > g2. The action is invariant, but the only 
vestige of the boundary condition is the topological winding number encoded in 
gı or any other homotopically equivalent boundary gauge group element and the 
corresponding boundary condition. 
Let g(x!,a?,s,2*) with s € [0,1] be a homotopy from gı to go: 


g(s=0)=g1, g(s =1) = ge. (11.50) 


Then for 
h=h(a",x*, £, x) = g(x',2”, (x? — L3/2)/A,2*) (11.51) 


2 See Equation (11.71). 
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for x? € [+L3/2,L3/2+ A], and with the gauge field extended as A, = hd,h~, 
evidently the action does not change. However, the gauge condition A3 = 0 is 
not respected. The choice 


=i 
A, = ta nes (11.52) 


0 b=3 
does satisfy the gauge condition, but the action is slightly changed. The increase 


in the action is easily calculated, using gauge invariance. We transform the gauge 
field of Equation (11.52) by h71, which gives 


0 3 
A, = Ure (11.53) 
hth L=3 


Then, integrating only over the extension, 
a l a l 
Se=- d sgg Fue Eun) = — d ryg Ets) (11.54) 
Then A; = g7!0,g03 (22) = (g7'0.9) 4 ~ 4, and consequently 


1 
Fys = OAs — 054, = 0,43 ~ A (11.55) 


as the commutator term vanishes. Thus (Fuy Fv) ~ on But the integral 


L3/2+A 
| dr~ A, (11.56) 
L3/2 


which implies that the action also changes by a negligible amount 
1 
89e ~ q = (Ls)? 40. (11.57) 


Hence a change in the boundary conditions that preserves the winding number 
is just a surface effect, not a volume effect. The action is invariant. Therefore, 
only the winding number remains, which is defined by the gauge group element 
g(a',a?, L3/2,2*) which defines a map S? > G. 

Now suppose we decided to choose a different boundary condition, not the one 
that fixes the tangential components of the gauge field on the boundary but some 
arbitrary, other boundary condition. We will still work with the gauge condition 
A3 =0. The condition that the action be finite still imposes that 


1 
A,,= 98,97! +0( =) (11.58) 


and nothing obstructs from performing the x? independent gauge transformation 


2 


that gives g(x!,x?,x+) on the end surface at x? = L3/2, in the same way 


as before. We will compare gauge field configurations in this gauge. But 
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now g will not correspond to any such gauge group element obtained when 
the boundary conditions fixed the tangential components of the gauge field, 
although the homotopy class of g is fixed by the winding number. Thus gauge 
field configurations giving rise to g would not be included in the subset of 
configurations that satisfy the boundary conditions on the tangential components 
of the gauge field that we have considered. However, the arguments given above 
show that, although we do not get the exact gauge field configurations with 
different boundary conditions, we do get configurations that are arbitrarily close, 
by small deformations near the end at x? = L3/2. Indeed, any given g can be 
obtained by making changes to the gauge field configuration only in the extended 
part of the box, as we did when defining the homotopy from gı to g2, now we will 
simply consider the homotopy from gi to g. Thus all gauge field configurations 
apart from a small difference in the extended portion of the box are permitted by 
our boundary conditions, and this difference gives negligible change for a large 
enough box. 


11.3.2 The Theta Vacua 


Therefore, for a large enough box, we can simply forget the boundary conditions, 
but impose that all configurations in the functional integration correspond to 
those of a fixed winding number n. 


F(V,T,n) SN f DAEs, (11.59) 


where DA = D(A, Ao, A4). F(V,T,n) is a matrix element between an initial state 
and a final state that are determined by the boundary conditions. For T} and To 
taken very large, 


F(V,T, +T,n)= 5 F(V,T,,m1)F(V,T2,n2). (11.60) 
nytng=n 
This is because the winding number 


1 5 
pea J tiia (11.61) 


is an integral of a local density (Fav Euv). This means that one way to put a 
configuration of winding number n into the box with Euclidean time length 
given by Tı + 7) is to put v =n; into the first part of the box and v = nə into the 
second part. Such configurations neglect configurations with significant action 
on the border between the two parts; however, we expect that this contribution 
is negligible for large T; and T>. Normally a matrix element for T = Ti + Tə that 
gets a contribution from only one energy state follows a multiplicative law. The 
convolutive law of combination of the matrix elements above, Equation (11.61), 
can be simply disentangled into the more familiar multiplicative law by a simple 
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Fourier transformation. Defining 
F(V,T,0) =) e°F(V,T,n) SN f DacrSeei" (11.62) 
implies 
F(V,T, +T2,0) = F(V,T,0)F(V, 72,9). (11.63) 
This implies the existence of states such that 
F(V,T,0) = N'(0|e~ #7 |6), (11.64) 
where the states |0} are eigenstates of the Hamiltonian. Our field theory is now 
surprisingly separated into a family of sectors enumerated by 0. In each sector we 
use the same action except we add an extra term proportional to v = 0(F; A ae 
We can obtain all of these results from the functional integral and from the 
possible instanton solutions to the Euclidean equations of motion. If there is 


a solution for v = 1, all the results follow. Suppose such a solution exists with 
action Sg. Then translation invariance gives at least four zero modes, and 


(dle—#7|6) = xf DAeTSE et? 
pas Keo) n+n (VT) e i(n— a) /nin! 


= 2KV TES cos 0 (11.65) 


where K is the usual determinantal factor and a sum has been done over n 
instantons and ñ anti-instantons. Then we can read off the energy of the |8) 
states, 

E(0) = —2V K cos6e~*°, (11.66) 


We can also compute the expectation value 
E 1 2 
(O\ Fv Fn)18) = | PRONE RaO) 


_ 32? [DA ve SE e? 
VT | DAe~Sue? 


—327r7i d Sr iv 
T E WwW 
= P7 ln (/ D Ae e ) 


—32n7i 
= Vr aS 2K cosde— So) VT 
= —64n7iKe~ sind. (11.67) 


The answer is imaginary, but this is correct. Since (F, tak, uv) = (Fi2Fs4 + perm.) 
in Euclidean space, analytic continuation to Minkowski space yields, for example, 
Fj4 — iFjo. Hence the imaginary result in Euclidean space corresponds to the 
correct, real result in Minkowski space. Everything depends on 9, it is a physical 
parameter. 
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11.3.3 The Yang-Mills Instantons 


The instantons do actually exist as solutions of the Euclidean equations of 
motion. We can prove that the action in the single instanton sector is bounded 
from below, and when the bound is saturated, the configuration must satisfy the 
equations of motion. Consider the inequality which is evidently satisfied 


— I dtx( (Fav + Fuv)(Fuv + Fuv)) > 0, (11.68) 


note that the minus sign is there because our gauge fields and field strengths are 
anti-hermitean. This implies 


- f ate (Eua aE (Fuu Fav) 2 HPP) >0. (11.69) 


But the first two terms are equal, hence choosing the + sign appropriately, we 
have 


TEEME | E (11.70) 


But the right-hand side is just the instanton number while the left-hand side is 


proportional to the action, thus we find 
8r? 


as we had promised to show earlier. The equality is attained for 


ig = ay, (11.72) 


where we get the + sign for v > 0 and the minus sign for v < 0. If we can find the 
solutions of Equation (11.72), we automatically get solutions of the full equations 
of motion, as the action is minimal for such configurations and hence, must be 
stationary. A bonus is that Equation (11.72) as a differential equation is now 
a first-order equation, instead of a second-order equation, and is consequently 
easier to solve. 

For v = 1 we will look for a solution that asymptotically behaves as 


=] 1 
Ay =9 Oy (9) +o(+), (11.73) 


where g‘) is the gauge group element defined in Equation (11.19). g@ is 
spherically symmetric, hence we make the ansatz 


-1 mii 
A, = f(r)r?g®3, (®) = -iAig’. (11.74) 


Using a double index notation, for the gauge group SU(2) seen as the diagonal 
subgroup of SO(4), we can write 


it a eee 
A= 5 (a + T (11.75) 
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for anti-symmetric SO(4)-valued gauge fields AGP = — Abe, An easy calculation 
with g given in Equation (11.19) gives 


ASL = F(r)(£aðug = TBa). (11.76) 


Then a straightforward calculation gives 


£ 
Fil! =f -P PYCpabve—Sya6i0)+ (2) 
X (LolpOvg — Cal Ong + LBL Ope — CBLyOva)- (11.77) 


The condition of self duality, Fy, = Pav, is automatically satisfied for the first 
term (d,0618 — Ôb óva) = Envor Eorab 

1 1 

zon (Jadu a 58 Ove) = g Aon Eenvor for ap = €\patEotaB (11.78) 


but not so for the second term 


1 
gpn (Tak pdup — Lol Ong t+ Lely Opa — CBE pOva) = Expy Blak p — Ezppa t bE p- 
(11.79) 
Thus we obtain a self dual field strength by imposing 
‘io 
L 4 f? =0,. (11.80) 
This integrates trivially as 
1 
MJS ap (11.81) 
where A is an arbitrary integration constant. Thus 
r? -1 
Ay = 59D (oe). (11.82) 


We will find that there exist eight parameters corresponding to symmetries of 
the action that are broken by the solution. These correspond in principle to scale 
transformations, rotations, translations, special conformal transformations and 
global gauge transformations. Scale transformations correspond to changing A. 
Note that the global gauge transformations preserve the gauge-fixing conditions 
A3 = 0. Rotations and global gauge transformations are tied together, the 
solution is invariant under the diagonal subgroup of simultaneous rotations 
and global gauge transformations by the same amount. Note that the rotation 
group SO(3) and the global gauge group SU(2) are essentially the same 
group. Special conformal transformations can be obtained by translations and 
gauge transformations and hence do not give rise to new solutions. This in 
the end leaves eight parameters, coming from one scale transformation, four 
translations and three rotations (or equivalently global gauge transformations). 
For a configuration on n instantons and 7 anti-instantons, the number of 
parameters is simply 8(n +ñ). 
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11.4 Theta Vacua in QCD 


The existence of the instanton solutions means that there exist different, 
inequivalent classical ground states, between which the instantons mediate 
quantum tunnelling. We have not explicitly seen these vacuum configurations; 
to uncover them, it is more convenient to use the temporal gauge, i.e. Ap = 0. 
As, in principle, everything we do does not depend on the gauge choice, we are 
free to take any gauge that we want. The dynamical variables in this gauge are 
just the spatial components of the gauge field Aj. 
In Minkowski space, the Hamiltonian is given by 


H= fde (EP? +(B)”), (11.83) 


where the electric and magnetic fields are given by 


EY = A? 
1 
B? = seijn (j A} — OnAG + FP°AAG). (11.84) 


In this gauge, since there is no field Ap, the equation of motion that usually 
comes from varying with respect to it is missing. This is the Gauss law 


G° = 0,A% + fÜ A Ao = (DiE? = 0. (11.85) 


However, in this gauge the Hamiltonian is invariant under time-independent, 
spatial gauge transformations. The corresponding conserved charge is actually a 
local expression, exactly the Gauss law operator, G° = 0, i.e. 


[H,G*] =0. (11.86) 


Thus we must impose this constraint on the initial values of the fields, then 
since the time evolution preserves the constraint, the Gauss law operator will be 
preserved for all time. 

Now in the quantum theory, eigenstates of the field operator A? (x) correspond 
to the states |A%(x)) and the amplitude for a transition between two such states 
is given by the functional integral 

(Āta T |ag(a)) = f CO D(A exe E tba HB), 
Qa 
- (11.87) 
where the functional integral is over all time histories A$ (g(t) that interpolate 
between the initial and final configurations. In the quantum theory, however, the 
Gauss law constraint is imposed as a constraint on the Hilbert space, a physical 
state in the Hilbert space of all states is one that is annihilated by the Gauss 
law operator 
G" (2) |b) = (DiEy)|W) =0, (11.88) 
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the states |A%(x)) do not satisfy this constraint. We wish to characterize the 
states that do satisfy the Gauss constraint. 
Under a gauge transformation 


A; SU AU! +i dy! (11.89) 
with Ua = e"T" where A(T) is independent of t. Defining the corresponding 
conserved charge 

Oe / dx At (8,4° + FA APY?) 
=~ f ar @D.B," (11.90) 


integrating by parts and assuming A* — 0 for |z| > oo. Then the gauge 
transformation can be effected by Q) as 


AjZ,t) > Ald, t) =e AZ, te’ 
= A;#,t) —i[Q), AiZ,t)] for infinitesimal A 
= A;Z,t) -i fèy 
x [ÀR (T0) (BAG) + FAIT, DA T, t)) AlE) 
= Ag, t) — (D;A)°z,t)T? (11.91) 
using the equal time canonical commutator LÄS (p, t)), A® (Z,t))] = 883 (# — y) 
and that the time variable in the integral expression for Qa can be chosen 
arbitrarily since it is in fact time-independent and here conveniently chosen 
equal to the time variable ¢ in A;(Z,t). Thus Q) generates the infinitesimal gauge 


transformation corresponding to A, and physical states should be invariant under 
the action of this gauge transformation, i.e. 


eLa jy) = |Y) (11.92) 

for À falling off sufficiently fast as |Z| > 00. But 
ea A$ (x)) = [Ua (Ai +i0,)Uy*), (11.93) 
ir ( 


where Uy =e" T? T herefore, a physical state will be obtained if we sum over 


all states that are related by a gauge transformation 
|Ai(Z)) prysical = I DA (E) SN | AF (x) (11.94) 


integrating over a dummy field variable A’. This is obvious since the integration 


measure is invariant under translation, hence 


etA |A; (E))physicai = eA / DN (a)e!2x'|A2(«)) = / DN (Bei |AS (2) 


= [ DX (We! 1AF (2) = Ail) physical (11.95) 
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Now what are the possible classical ground-state configurations? Such 
configurations must have H = 0. This requires E} =0 and BY =0. A vanishing 


1 


magnetic field means that the gauge field is pure gauge A; = gOjg-* and a 


vanishing electric field means A; = 0, which implies g = 0. Thus for a classical 
vacuum the gauge field must be A; = g(Z%)0;(g(Z))~!, and then we must 
implement gauge invariance as in Equation (11.94). Writing a corresponding 


state as |g(Z)), for a gauge transformation 


Ig(Z)) > e'|g(@)) = Uag(@)) = |9(@)). (11.96) 


Since À — 0 for |Z| > co, g and g must be homotopically equivalent, i.e. Uy is 
homotopically trivial (evidently, just switch A — 0). Hence a potential vacuum 
state |0} is given by 

|0) = bD |g(2)). (11.97) 

gEone homotopy class 

Without loss of generality, for the state |0) we choose the equivalence class 
corresponding to all gauge group elements that are homotopically trivial, i.e. 
in the same homotopy class as the constant, identity gauge transformation g =I 
and are generated by multiplication by Uy. 


|0) = X |g(2)). (11.98) 


g € trivial homotopy class 


But what if we define a different vacuum, obtained from |g(£)) 


|0) = n |g (2)) (11.99) 


gE homotopy class of g 


for some other gauge group element g which is not in the trivial homotopy 
class. g must go to identity at infinity. If g does not go to identity at infinity, 
and instead goes to some other constant gauge group element, go, then such 
a state is irrelevant. The matrix element between the state so defined and the 
state |0) must necessarily vanish since we must integrate over configurations, in 
Equation (11.87), which are spatially constant at infinity but change in time from 
I to go. Such configurations will have a non-zero A; over an infinite spatial volume 
(at infinity), for which the action is infinite and consequently the transition 
amplitude vanishes. Thus g defines an element of the homotopy classes that 


need not be the trivial class. Evidently the state |0) is also a vacuum state; the 
corresponding classical field configurations that we integrate over A; = g0;g~! in 
Equation (11.94), are all of zero energy. 

All gauge group elements that we are considering here satisfy lim)z|-,.. g(#) > 
I. Thus, topologically, all gauge group elements are defined on R? with the point 
at infinity identified, topologically $3. Each g(#) defines a mapping from S? > G, 
which fall into the homotopy classes of Ia (G) = Z. We can index the homotopy 
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classes by an integer n, 
g® = 
gO =e" ERP 
gi) = (0) l (11.100) 


Correspondingly, we can enumerate the classical vacua with the integer n 


In) = SS l9) (11.101) 


gof winding numbern 
and the winding number is given by the formula 


1 
24r? 


n= 


| Pret" (90:97 *gðjg t gðkg') : (11.102) 


If we denote by R, the operator that implements the gauge transformation g™), 
then 


Rin) =|n+1). (11.103) 


Note that g) £ eiT? with lim)z|+00 à > 0, hence gauge invariance does not 
impose that the vacuum state be invariant under action of R. But physically we 
would imagine that gauge invariance would require at least that 


RIY) = |v), (11.104) 


since we cannot physically measure an overall phase factor. A vacuum state that 
satisfies Equation (11.104) is called a theta vacuum, and is denoted by |0}. 

There is no physical principle that can predict 0. However, 0 must be time- 
independent since [H, R] =0. 0 must also be gauge-invariant. We say that 0 labels 
the superselection sectors of the Hilbert space and the Hamiltonian is diagonal, 
block by block, for each superselection sector indexed by 0. We can explicitly 
construct the state labelled by 0 by a simple Fourier sum 


Co 


l= So en). (11.105) 
n=—00 
These are the physical vacua of QCD, gauge invariant under trivial gauge 
transformations, and invariant up to an overall phase under topologically non- 
trivial gauge transformations. In the next section we will see how instantons give 
rise to these theta vacua. 


11.4.1 Instantons: Specifics 


In this section we will complete the specifics of the instanton solutions, some of 
which we have already used. The solution from Equation (11.75) is given by the 
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gauge field configuration for gauge group SU(2) (we will put all indices down for 


convenience) 
7 1 
AP = pe (Padus — £5 p0)- (11.106) 
The corresponding field strength is 
r? +2) 
FeS = qap zz Orave — Supôva). (11.107) 


Equivalently in matrix form 


2 —1 
& 1 1 
sz l Ə, (g! ) (11.108) 


Obviously, as r — œœ, the field strength vanishes quadratically as ~ 1/r?. 
Thus the action of the instanton is located in an essentially compact region of 
Euclidean spacetime. This was the reason for the name “instanton”; if we scan up 
through Euclidean time, there is nothing at the beginning, then, for an instant, 
the instantons turn on and off in a localized spatial region, and then, again, there 
is nothing. 

The instanton solution is rotationally invariant when compensated by a global 
gauge transformation. This is evident for the field strength F eee For the gauge 
field we must have the same. A rotation is defined by 


se Nel ty (11.109) 
with 
Ryv Rye = Ove, (11.110) 


since £0, Ryyt)Ryoto = Ryy Ryo) Lg. For infinitesimal transformations, we 
have Ruy = Oy + Apv, where A,» is infinitesimal. Then 


Tu > T, = Sy t Ap ey (11.111) 
and 
(On Aue) One F Mug) Sree hee + hoe = Ones (11.112) 
which requires 
Agu t+ Ave = 0 (11.113) 
or equivalently, Ag, = —A,.. Then the gauge field transforms as 
A, (2) > A,,(2") = Ap (£) + (Aus + ôpvAor£oðr)A (1) (11.114) 


thus the gauge field A, (x) = ee ae will transform as 


2 
$ 
Ap(£) > Al, (x) = A (£) + ape (Ayg0vg7' + (Nor toOrg) Ong * 
+90u(Acrt00rg *) — gAprâô-g7}) 


E ET (g(2")Oug7*(2')) (11.115) 
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using that r is invariant and g(x’) = g(x) + Aor£oðrg to first order. Then 
explicitly with g(x) = g'(x) = (x4 + izo’) 


g(a’) = = (0 


ch, fel a 

t in’o') + —Ag 20, (x* + ixio’). (11.116) 
r 

Then with Aja = Ài and Aij = €ijkAk We get 


1 
g(x’) = g(x) + zA (2104 — £40;)(x4 + ix;0;) 


1 ; 
+ Eijk UR TiO; 
1 f i 
=g(x) t Aile: i2401) + Z EijkAkT:O; 
1 z ; 
=g(x) +4 z (xa(— idii) + £il Ait iCijkTjak)) 
1 , . l 
ae (xa(1 — idigi) + @; (40; + Ai + icijkojak)) 
1 ; ; . 
= z0 — iyidi) (£4 +izjoj)(1+ibkok) 

(11.117) 
where A; = 7; — 6; and a; = 7; + bi, since, continuing the algebra to first order 
we get 

! 1 . ` i 
g(a) == = (wal = i(i — Bi)oi) + vito + 17950% — pjoo) 
1 . ; ; 
= 7 (wal tly — Bio) + xilioi + (yi — Bs) + icijk(Yj +83 )on) (11.118) 
confirming 


glx’) = (1 — iyioi)g(£)(1 + ibisi) =a" g(x)b, (11.119) 


where a = (1 + yisi) while b = (1 + i6;0;) to first order. Then from 
Equation (11.115) 


—— a7" gbð, (btg ta) =a7* A, (a)a. (11.120) 


Thus the solution is clearly invariant under an arbitrary choice of b. Thus the 
instanton is invariant under an arbitrary choice of b, but it is not invariant 
under a. 

This should give rise to three zero modes, it does, but in a slightly more 
complicated way. The important point is that rotations can be compensated by 
global gauge transformations. The rotation group SO(4) = SO4(3) x SO,(3) is 
explicitly broken to the SO,(3) subgroup. The instanton is invariant under the 
SO,(3) subgroup and it does not give rise to new solutions or equivalently to zero 
modes. In principle, this subgroup can be used to characterize the representations 
under which the physical states of the theory transform, exactly as, for example, 
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the invariance of a physical system under spatial rotations tells us that the 
physical states of the system must transform according to representations of the 
rotation group. The broken subgroup is SO,(3), which should give rise to new 
solutions and zero modes, but its transformation can be exactly compensated by 
a transformation of the group of global gauge transformation SO,i.(r), which is 
also SU(2) = SO,i.(3). Under a global gauge transformation 


A! (2) > hA! (2') h = ha™' A, (£) ah-* (11.121) 


hence h =a is a symmetry of the solution. Therefore, the symmetry group 
SOgi.(3) x SOa(3) is in fact broken to the diagonal subgroup SOq(3), 
corresponding to h = a, which remains a symmetry of the instantons. The 
anti-diagonal subgroup SOaa(3), with h = a7}, is broken by the instanton 
configuration, and gives rise to exactly three zero modes. Thus the rotation 
group is broken to SO,(3) 


SO(4) = 304(3) x SO,(3) > $0,(3), (11.122) 


while the group of global gauge transformations SU (2) = SOq.(3) is mixed with 
the rotation group 


S'Ogi.(3) x SOa(3) = SOa(3) x SOaal3) (11.123) 
and the diagonal subgroup remains an symmetry of the solution, while the anti- 


diagonal subgroup gives rise to three zero modes. 


11.4.2 Transitions Between Vacua 


The instantons are perfectly suited to describing quantum tunnelling transitions 
between the |n) vacua. The solution can be put into the gauge A4 = 0 by the 
straightforward gauge transformation A, —> h(A, +0,)h~! with 


h=P (e Eat, (11.124) 


Then at 7 =—oo the gauge field is 


Aj|r=—co =0 (11.125) 
but at T= co we have 
Ailr=oo = 90:97"; (11.126) 
where l S 
gee) Hr? (11.127) 


The gauge field is given in slightly different notation by ’t Hooft [112], 


a x 


Aq = 2a a Manes (11.128) 
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where ayy is the ’t Hooft tensor 
1 
Napy = E4apy + g CabeEbepy (11.129) 
with corresponding field strength 


2 
Pre, =4 


pv CESU (11.130) 


This configuration corresponds to a change in the winding number between 7 = 
—oo and T = œ 


<i T=00 
An= a? J Prex" (gðig *gðjg t gðkg t) 
1 T=00 
1 4 
= 39,2 d*20,G4 
1 
= zA pes (3 Gu — O:G:) (11.131) 


where G, was defined in Equation (11.27). The spatial components G; are, using 
A, = 0, 


2 : ; 
Gi ~ Eino (Au OA + z4nArAa) = €ipoApAy = Eijk Aj Ák, (11.132) 


which vanish as |Z| — oo since the electric field A, = Ep —> 0 so that the 
subtracted spatial divergence gives no contribution from the surface at infinity. 
Thus we find 


1 1 z 
An = T [erac, = gr [ate ee ee = h; (11.133) 


Thus the instanton mediates transitions with the vacua |0) > |1). 
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We will now consider quantum corrections that come from the Gaussian 
functional integral that should be performed about the instanton configuration. 
We must first extract the zero modes. For the single instanton, there exist, in 
fact, eight. In principle the amplitude (1Je~#70) is given by 


_1 pdtao.( £2 ay). 
(ie“# 0) = Z(T) =e f DQ) zi deo (agatat) ° (41.134) 


where Q; is the fluctuation that gives rise to the quantum corrections. The zero 
modes coming from translations and scale transformations can be eliminated by 
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using a Faddeev-Popov method. We insert one into the integral 
1= 5 fer (/ dtz (L(A8(2)) (a, — R,))) ie dA?) 
a( f ata(eras(a)y(e—Ry-»). (11.135) 


The delta functions choose the values of R and A as 


5t (fae (L(A2(2)) (ey — R,))) zy (([ atecvaztey) Gah -) 
_ 64(Ry— Ry) (Ry — Ry) 
- (fd £(Ag(a)))* 


with the obvious definition 
g _ Late (LAS) en) 
5 PECL) ` 


Furthermore, 


%2 = J dta L(Ag(x)) (2 — R)? 
fdaL(zx) 
and here R,, could be replaced with R, because of the first delta function. 


(11.139) 


Evidently, R depends on what gauge field A2 (x) is chosen: it should correspond 
to an instanton, and it contains the data on where the instanton is and its scale, 
à. Then starting with Z (T) slightly differently 


zT) =s} f ane) fate [ DQE S 2 
x5" ( fate (COQ). Ro) ) 8 ( f Eeee- R=»). 


(11.140) 
First we perform a translation and a conformal scaling 
Tu > T, =Atyt+ Ry, 
Qi(@v) > AQ? (E). (11.141) 


The action is invariant under a translation and conformal scaling (actually under 
all special conformal transformations) with 


L(x) = -gTr (3 Qu (2) — WQu(x) + [Qu (1) Q? (11.142) 


11.5 Instantons and Confinement 233 


we have 
d*xL(AQ(Ar — R)) = —A~4d*a'Tr (A0, AQ (2) — ADL AQ p(x") 
+)7[Q,(2"),Qu(2’)])° /Ag? 
= -d'a'Tr (8, Qu (2) — IQ ul) + Qula’),Qu(2")])” /49? 
= dtx' L(x") (11.143) 


and we will simply rename qz’ + x. Then we will change the functional integration 
variable 


Qi (Al + QI) (zy), (11.144) 
where A; is a classical field, an instanton solution. Expanding the action to 


second order in Q, (the first-order variation vanishes after one integration by 
parts, as Ar, satisfies the equations of motion), we get 


4 a 8 a 
an-s fao fen foge temou, 
xt ( f ata (C4, +Q) 8 ( f ael,- )]) 
=3f% a 3 SE fo gaye P e Q- L(At(2))-@ 


x6! ( foo, +Q,N(a,))) 5 ( [aca +e- 1)) l 
(11.145) 


Now we choose the instanton configuration to be centred on the origin and of 
unit scale size, i.e. 


| taceam f daea- =0. (11.146) 


Then expanding in the first-order Taylor expansion of the action in Q,, and using 
the notation D4 to be the covariant derivative with respect to Ma classical field 


Apu, Da- =0,-+[A>,] and the corresponding field strength F4, we have 
1 
L(A, +Q) = £(Av) — STr (F2.D2Q,) (11.147) 
g 


we get in the delta functions 


ôt (aes Tr (FA Droa) =ô (fae Grr (FA Q») (11.148) 


and 


a( fates Tr (FA DEQ; (2? -1)) = a(- 2 fda — Tr (Ff Qro) 


(11.149) 
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Thus the delta functions impose the conditions 
farr (Fi Q+) = 
paete (zo Ff-Q-)=0. (11.150) 


But these conditions are exactly the conditions that the quantum fluctuation 
Qu be orthogonal to the zero modes corresponding to translations and scale 
transformations, respectively. Indeed, for translations we simply transform the 
classical solution A, by the broken symmetry, translation in the x, direction 
generated by a simple partial derivative in that direction. However, this is 
not gauge-invariant, hence we also perform a gauge transformation, ôs A, = 
—DA4(A,), for an infinitesimal gauge transformation, to give 


tr. = 9,A,—DA(A,) = FA. (11.151) 
The normalized zero mode is qt": = Teaver tr. with N*™ defined by 
=s J d4xTr (ptr) 2=- f dtz pe Tr (FAFA) 
o fixed 
sam (FA FA) =9°S0. (11.152) 


We wish to keep track of powers of g and hence we note that the normalization 
factor does not have any powers of g since Sp ~ 1/g?. For the scale transformation, 
the infinitesimal generator is (1+ .2,0,)A, and then the infinitesimal variation 
of the gauge field, made gauge-invariant, gives the zero mode 


ws” =(1+2,0,)A; — DA (tg Ac) = to FA. (11.153) 


A simple analysis also shows that the normalized zero mode, pee = se / VNE, 
will not have have any net powers of g. 

The delta functions impose that the integration over Q should be restricted to 
the function space that is orthogonal to these zero modes. But in writing Q in a 
sum over normal modes there is a subtlety involved. We should add a factor of 
g in the expansion 


Q,=Ct"g Fen ig ar ie DE, (11.154) 
Tne g ; 


where ws are the non-zero modes. The integration measure is the usual infinite 
product 


dC? Cs ACs 
D2) =e vn l (11.155) 
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as long as the same conventions are followed in the numerator and the 
denominator for the functional integral. The reason for the extra factor of g comes 
from the exponent in the integrand. We examine the exponent more carefully, 


fee; Q- on L(A%(x)): Q=- /eerr(Q, (D2 D} — DE Do Suv — Fi.) Q). 

(11.156) 
The using the expansion in Equation (11.154) with its extra factor of g, and 
where oe explicitly are the normalized non-zero eigenfunctions of the operator 
of the second-order variation (DiDi — Do DA by, — FA) with eigenvalues €¢, 
we get 


pers =O: ae (Aa AD (c8)? (11.157) 


Notice that €¢ contains no powers of g, nor does he determinant that ensues 
after integrating over the CS and the zero modes simply drop out in the exponent 
and the delta functions control the integration over their coefficients. Then the 
delta function for translations gives 


A tr. 4 
"(3 [iarta 7E a)-e (E of | =( £ ) (C), 
g’ g Ta 
(11.158) 


The delta function for scale transformation also gives a factor of INT giving 


an overall factor of g5. Thus writing a prime as usual to indicate the restriction, 


we get 
20) =< Sho) om ee |S Aa] 
(IN (VN 
Di(Qaje L078? saa Ahr (11.159) 
with 
s=% (11.160) 


The overall power of g is then g~° due to the five zero modes that we have 
treated. There are in fact more zero modes associated with the global gauge 
transformations and the rotation group [112, 98, 24, 12, 65]. As we have seen for 
SU(2), the diagonal subgroup of these two is unbroken, but the anti-diagonal 
subgroup is broken with three broken generators. We will not analyse these 
explicitly, it will suffice to say that they give exactly another power of g~°, 
for the gauge group SU(2) giving the total, overall factor of g to be g8. In 
the case of interest, QCD, the gauge group is SU(3) with eight generators. We 
imagine putting the instanton solution in an SU(2) subgroup of SU(3), but 
then one generator always commutes with the SU(2) subgroup. For example, if 
we generate the subgroup with Aj,A2,A3, then Ag commutes with these three 
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matrices. Thus we find there are only seven broken generators. The SU(2) 
subgroup mixes in an identical way with the rotation group as in the case when 
the entire group was SU(2). Thus the upshot is there are seven additional zero 


modes, which give a factor of g~* ate 


and an overall factor of g 
We have computed the contribution of a single instanton to the transition 
between the two vacua |0) and |1). As usual, multi-instanton configurations are 


negligible except for those corresponding to well-separated single instantons. 


nt+tn 
Z(|0)  |1)) -È ee afe S P EK) 


n+n 
g -ir 
-È Ôn-n,1 ae rls yee K) (11.161) 


where K is the determinantal factor including various other normalization factors 
and constants independent of g. This result directly generalizes to the amplitude 


and finally to 
Z(0) = (le #7718) 


l 
®, 
F 
| 
è 
A 
3 
J 
2 


II 


= a) ee 
( 5 Je (2esovr ee K). (11.163) 
The infinite, constant prefactor is simply a consequence of the plane wave 
normalization of the theta vacuum states. 

Then from Equation (11.65) and Equation (11.66) we get the energy of the 
ground state 


dN _ 8% 
(0) /V =—cos6 | 559 e ? K. (11.164) 
0 


We have purposely left the g-dependent factors inside the integration over 
A for a reason, and we have absorbed all constant factors into K. This is 
because evaluation of the determinants requires renormalization of the coupling 
constant, and renormalization inserts a scale dependence into g and K. The 
infinite product of eigenvalues of the operator corresponding to the second 
variation of the action in the presence of the instanton, Equation (11.156), 
is not rendered finite when divided by the same infinite product but in the 
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absence of the instanton. We have to add counterterms with infinite coefficients 
so that the divergences are absorbed. Adding the counterterm proportional to 
~ FF, exactly renormalizes the value of the coupling constant g. However, 
the renormalization inserts a dimensionful mass scale, M, into the theory, which 
fixes the physical, finite, observed value of the coupling constant at that scale. 
The coupling constant obeys the equation 


arene! 11 
P(A) P(M) 8r? 


where the dependence comes from the simple fact that g is a dimensionless 


In(AM) +0(9°), (11.165) 


coupling constant, since the only dimensionful parameter that exists, apart from 
the renormalization scale M, is the instantons scale A, the two must come 
together in the dimensionless combination AM. The factor -4 is the famous 
result of asymptotic freedom for the beta function of QCD, which is a long, 
hard calculation in perturbation theory |57, 102], which we will not describe 
here. Asymptotic freedom means that as the scale À gets smaller \< 1/M, and 
the instanton size goes to zero, the coupling constant g?() becomes smaller as 
IndAM is negative and the right-hand side becomes larger. Indeed, replacing the 


solution Equation (11.165) in the expression for the energy gives 


E(6)/V 
= of S ELE rao e e 
= — COS g 5 JM) Bae n e 
872 


dA ee -8> a 
= —cos6 f 559 2(M)e #4) (1+0(g?(M)n(AM)))et POM K(AM) 


872 


= ~cos0 | AMM" g-?(M)e 7D (1+0(g?(M)n(\M))) K(AM). 
(11.166) 


Thus for small A, in the ultraviolet, the integral is perfectly convergent; however, 
in the infrared, as A — oo the integral is obviously divergent. Thus the integral is 
well-behaved in the region where we trust our calculations, when g — 0, but does 
not make sense in the regions where g > 1, where we do not trust our calculations. 
Indeed, we expect new, non-perturbative (not instanton effects, which are also 
non-perturbative but only valid for small g) strong coupling effects to kick in as 
g becomes large, effects which we have made no pretence to be able to compute. 
Thus we stop the calculation at this point, content with the expectation that 
large coupling, confinement-related effects cure the behaviour of this integral. 


11.6 Quarks in QCD 


We will next consider the question of quarks in QCD. The quarks come in the 
fundamental representation of SU(3), which is generated exactly by the 3 x 3 
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Gell-Mann matrices of Equation (11.6) and in six flavours, up, down, strange, 
charm, top and bottom, which we will denote by a label a, and correspond to 
Dirac fields w(x). The colour index is suppressed but takes on three values, 
1,2 and 3, thus the Dirac field is a three-component column for each flavour 
index. The Lagrangian density in Minkowski spacetime is then given by 


L= aTr Fala) w) FH" (x +d (Y° (a)iy" (Op + Au) (2) — M° Y (a) h(a), 


(11.167) 
where the gauge field is a 3 x 3 anti-hermitean matrix, A, = iA Aa Fav =A — 
0, A, +[A,,, A] is also anti-hermitean. The y“ are the usual Dirac matrices. The 
masses m“ are quite small for the up and down quarks, less than 10 MeV. 
Thus the massless limit is a reasonably good approximation when considering 
processes that largely imply only the up and down quarks. This limit has a 
higher symmetry, called chiral symmetry which is spontaneously broken, and 
can be treated with chiral perturbation theory. The strange quark mass is a 
little more, around 95 MeV, but still within the purview chiral symmetry and 
chiral perturbation theory. The charm mass is about 1.3 GeV, the bottom mass 
is 4.2 GeV, and the top mass is 173 GeV. Neglecting these masses is not a good 
approximation. In what follows, we will restrict our considerations to the up and 
down quarks and neglect their masses, which is a rather good approximation. 
Then the fermionic part of the Lagrangian density is 


L= Y Pai + Av"). (11.168) 


a=up,down 


The Lagrangian in this case has a symmetry SUz,(2) x SUp(2), called 
chiral symmetry. The subscripts L and R correspond to independent SU (2) 
transformations on the left-handed and right-handed components of the Dirac 
spinor. If we write the spinor fields as 


E bu 
»={ is (11.169) 


then the chiral transformation corresponds to 


py fe") an AP an ay, (11.170) 


where z, r are independent parameters of the two SU(2) transformations and 


the č are the Pauli matrices. The chiral projection operators, 145 , project onto 


the left-handed and right-handed components of the Dirac KT 


v=titue=(2)u (2) u (11.171) 
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In the chiral representation of the Dirac matrices, 


10 0 0 
01 0 0 
= 11.172 
Bloo -1 o0 ( ) 
00 0 -i 
so that 
0000 10 0 0 
1— 1 1 
5 _ 0 0 0 0 t% 0 0 0 (11.173) 
2 0010 2 0000 
000 1 000 0 
The Lagrangian Equation (11.169) is also invariant under two U(1) symmetries, 


Uy (1) x Ua(1), Per 
ap — e259 y (11.174) 


for independent parameters 0 and 6’. The corresponding conserved currents, via 
Noether’s theorem, are denoted 


ado and jE = Yyy. (11.175) 


The chiral symmetry group SUz(2) x SUr(2) is spontaneously broken to the 
diagonal subgroup SUp(2), which is identified as the isospin group, in this 
case with just two flavours, up and down. Due to this spontaneous symmetry- 
breaking, the Goldstone theorem [56] implies the existence of three Goldstone 
bosons, massless scalar fields, which are then identified with the pions. The 
pions are not massless, however; the mass terms for the up and down quarks 
softly but explictly break the chiral symmetry. The consequence of this explict 
breaking is to give the putative Goldstone bosons, the pions, a small mass. 
This analysis is called chiral perturbation theory [117]. The Uy(1) symmetry 
corresponds to the baryonic charge and is presumed to be conserved. The one 
question that remains is what happens to the U,(1), how does this symmetry 
manifest itself? If it is not broken, spontaneously or explicitly, then it should be 
associated with a conserved quantum number. We do not see any such additional 
conserved quantum number. If it is spontaneously broken, then we should see 
another corresponding massless Goldstone boson. It can be shown that this does 
not correspond to the 7, Weinberg has shown [118] that the mass of such a 
putative Goldstone bosons must satisfy the inequality mg.p. < Vm,- This lack 
of understanding of how the U4 (1) symmetry manifests itself is called the U (1) 
problem. 

The U(1) problem is related to instantons, the theta vacua and the chiral 
anomaly, which we will explain in this section and the next. The upshot is that 
the U,4(1) symmetry is actually explicitly broken, due to a quantum effect, called 
the chiral anomaly. To understand the chiral anomaly it is easiest to work in a 
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function integral formulation for the fermionic fields, the subject to which we 
will now turn. 


11.6.1 Quantum Fermi Fields 


Canonical quantization of fermionic fields demands that the fermions satisfy 
equal time anti-commutation relations 


{Y(T t) iv! (g, t} = ind? (T -— 9), (11.176) 


where {A,B} = AB + BA. Why does the anti-commutator arise? For a free field 
with equation of motion 


(ið, —m)wv =0 (11.177) 

we can construct the solution by simple Fourier transformation 
p= ls ba(k)ue(k)e"*** + dl (k)u%(k)e**) , (11.178 
Sia main Z Coben (kyu (k)e"**), (11.178) 


where k = ((k2 + m?)!/2,k), u%(k) and v%(k) are specific, orthonormalized 
spinor solutions of the Dirac equation (11.177) of positive and negative energy, 
respectively, while b,(k) and d'(k) are arbitrary, operator valued coefficients. 
The expression for the Hamiltonian (energy) then becomes 


H = 


3 => 
S (K? +m?) X (b) (k)ba(k) — da(k)dl,(k)) , (11.179) 


where we have not changed the order of the operators in the expression for the 
Hamiltonian. The order of the da and the df, occurs because we expanded y 
with di, rather than da but the minus sign occurs because the v®s correspond 
to negative energy solutions of the Dirac equation. If we had used da in the 
expansion of ~ we would have arrived at the expression in Equation (11.179) 
with the da and the d!, interchanged; however, the minus sign would still be 
there. Now if we want to have H > 0, up to a constant, we need 


da(k)dl,(k) = -dt (k)da(k) +1, (11.180) 


where we have chosen the constant to be 1 for the case of discrete k. For a 


continuum of k’s we get 
{da(k), di,(k’)} = 54 (k — k’) (11.181) 


and the Hamiltonian, up to a constant (which can very well be an infinite 
constant!) is 


3 
H= | g +m)? D (balk) + di (k)da(k)) (11.182) 


a 


a positive semi-definite form. As the bas and the das are equivalent, we must 
choose anti-commutation relations for both. 
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11.6.2 Fermionic Functional Integral 


The limit 4 > 0 in the canonical anti-commutation relations Equation (11.176) 
does not yield ordinary, commuting c-number fields in the classical limit. The 
fields become anti-commuting fields, so-called Grassmann number-valued fields 
whose anti-commutator, rather than commutator, vanishes 


{Vz t) p gt} =0. (11.183) 


Thus the classical limit gives fields that are elements of an infinite dimensional 
Grassmann algebra from an infinite dimensional Clifford algebra in the quantum 
domain. Then, if there is to be a Feynman path-integral description of fermions, 
the integral should be defined over the classical space of fields, fields that 
are Grassmann algebra-valued. Such an integral can be formally defined. For 
free theories, perhaps all such formalism is rather unnecessary. However, for 
interacting theories of fermions, the functional integral description must at least 
be able to generate the perturbative expansion. In fact, we can almost think that 
the fermionic functional integral representation for the amplitudes of a quantum 
field theory with fermions is simply a very compact and efficient notation that 
can and does serve as a means of generating the perturbative expansion. 

Abstractly, an integral is a linear map that takes a space of functions to the 
real numbers. We will define the functional integral over a Grassmann number 
in this way, first for a finite set of Grassmann numbers, and then generalize to 
the infinite limit. A Grassmann number 0 satisfies 


{0,0} = 00 + 00 = 200 = 20° =0. (11.184) 


We define a differential operator a by the very reasonable rules for any other 
anti-commuting number ( and for a c-number a, 


d 
dé 


The derivative operator should be thought of as a Grassmann-valued operator; 


d d 
@=1, g= p= (11.185) 


it should anti-commute with other Grassmann numbers. A general function of 
f(0), i.e. a commutative function, can be expanded in two terms 


f(0) =a+ p0, (11.186) 


where a is real while 3 is Grassmannian. Then 


d d 


d d 
apf (0) = pet ah = — 08 = —B. (11.187) 


The idea that f is a commutative function means that it is composed of an even 


number of Grassmann numbers, 0 and 2 in this case. 3 is a Grassmann number, 
hence 
B =0, {8,0}=0 (11.188) 
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then it is easy to verify 


[f, 8] =[f,6] =0. (11.189) 
Then clearly 

dd @ 

db ae (11.190) 


This means that the integral can in no way be the inverse of differentiation, the 
derivative is a nilpotent operator. However, we will define it, following Berezin 
[13] to be a linear map from the space of Grassmann numbers to the real numbers, 
we define 


[ei=o ee (11.191) 
which implies 
[910 = | a6(a+56)=0+ f 4939=~ f 4996 =~. (11.192) 


For N Grassmann numbers we have the algebra 


{6;,0;} =0 
d 
(so) 
d 
lir (11.193) 
a J 


for i,j =1,---N. Then a general, commutative function is expanded as 
f(9;) =a + C0; + ¢459;0; +--+ +.00102---ON (11.194) 


and we notice it has a finite number of terms. Cijki-. is Grassmannian if the 
number of indices is odd but a real number if the number of indices is even. The 
integration rules generalize as 


fæn=o, fos; = bi; (11.195) 


and by convention and consistency 


fnd; 6102 = [em (J ao) M= - f% 6, =-1. (11.196) 


Then it is easy to see for an anti-symmetric matrix M (clearly any symmetric 
part of M will not contribute) 


"M; -0 2N/2 /det(M), for Neven 
In(M)= | dO,---d0ye7 Dis M995 — 
n(M) | ee 0 for Nodd 


(11.197) 
For an invertible, anti-symmetric M and a set of Grassmann parameters Xi, i = 
1---N and {xi Xj} = {xi,9;} =0, we can compute 


Tyy(Mix) = | dB -+-dBqye° Eu Myt (11.198) 
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as follows. Translating the integration variable 0; = 04 — 5M, eer we get, using 
matrix notation 


T 
67 M6—y70= (#— 5M -1x) a (e—5M tx) =x r(e- aM tx) 
1 1 
= 6’? Me’ i= SOTM Mo iis oT) 4 ax x 


1 
say? Of 4 ie 


1 
a 6’? Me’ 5 x70’ x76’) 
1 — 
mie Oh + ae at ty 
1 
=6'T M6’ + gx x (11.199) 


using M-!" M = (M?M-1)? = (-—MM-1)? = -IT = —I since M is anti- 
symmetric and the fact that the x is also anti-commuting. Then 


n(M;x) ea Mijo- 4xiMT txj) 


N/2 -Zij bam txj 
2/44 detM j4 J, for Neven _ (11.200) 
0 for Nodd 


For complex fields, we have the equivalent of complex Grassmann numbers 
_ 9, +762 «9-102 
V2 3 n v2 


Considering the 2 x 2 case, we impose ere 0;Mi;0; = in*Mn which gives M= 


(11.201) 


2Mı> and we have 


J dn* dne” ™ = det(M), (11.202) 


where the integration is done by treating 7 and 7* as completely independent 
Grassmann variables. Dropping the tilde, the integration formula generalizes as 


J [|dan de® = det(M) (11.203) 
i,j 
and with sources, suppressing the indices and summation signs, 
J [ [ant dnei Mne" ntin"t = det(M)e EME, (11.204) 


Then boldly generalizing to infinite dimensional integrals we get for the fermionic 
field 


Souper Paver? oem = det(iq”ð, —m) (11.205) 
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and including sources 


I Dh, pet f trti ou -mb+E EY — det(iyt ð, — meS rei aum) e 


si doia 
=N'e fd PCCP) gm SCP) (11.206) 


Then for a general gauge interaction the usual perturbative expansion ensues 
from the coupling (where we have expressly put the coupling constant e) 


L' =iepy Any (11.207) 
then 


Z(¢,A)= | DG pie erna 
iS ata stazi An giay) iS dote) pn) 
=N'e ôç (zx) 6C(ax) e pC(p a p l (11.208) 


The derivatives with respect to Ç(x) and ¢(a) can be trivially converted into 

derivatives with respect to ¢(p) and Ç(p) by Fourier transformation. This gives 

rise to the usual perturbation expansion expressed in Feynman diagrams [101]. 
Reverting back to Euclidean space, the action is 


Se= - f toiling’, — im), (11.209) 
where the y, matrices satisfy the Clifford algebra 


{Yu Ww} = 25yr- (11.210) 


The fields 7) and w are no longer related to each other, but are in fact completely 
independent Grassmann-valued fields. We can infer this from many points of 
view. First, and most importantly, if the formula Equation (11.203) is to work, 
the integration variables 7 and 7* are completely independent. First of all, 7 and 
n* satisfy 

{misma} = {ni nj} = {mim} =9- (11.211) 
Then if 7* were the adjoint of n, i.e. n* = nC, where C is a fixed matrix akin 
to a charge conjugation matrix, then the last relation would imply (multiplying 
by C7!) and contracting together 


X (mni + nim) = 0. (11.212) 
This says that the sum of two positive operators vanishes, requiring the operators 
to be zero. Additionally, the Euclidean Dirac fields transform according to the 
(5,0) @ (0,4) representation of the four-dimensional Euclidean rotation group 
SO(4) = SU(2) x SU(2). The two SU(2) subgroups are totally independent of 
one another, hermitean conjugation does not take one into the other, as is the 


case in Minkowski spacetime. 
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We decompose 7 and w with a complete set of orthonormal spinor solutions 
of the Dirac equation 


b=> arb, p=) andr, (11.213) 
where the coefficients a, and G, are independent Grassmann numbers and 
[atoviv.= | tag = ns (11.214) 
Then we define the functional integration measure as 
D(b,b) = | [ da-dā,. (11.215) 
Then 
Sg = - f daili, = -X Apres (11.216) 
where Í 
(Yuba) Pr = Arr. (11.217) 


Then the integral 
[Dudes = [To daedacc® are = [ [> =det(i,,0,). (11.218) 


In the massless limit, m — 0, the action Equation (11.209) is invariant under 
global chiral transformations, decomposed as vector and axial transformations 


pa eitn G Gete), (11.219) 


The chiral anomaly corresponds to the fact that it is impossible to define the 
functional integral while simultaneously keeping the axial gauge symmetry and 
the vector gauge symmetry. The full chiral symmetry of two-flavour QCD is 


SUy (2) x SUA (2) x SUy (1) x UA(1), (11.220) 


but the SU4(2) is spontaneously broken, giving rise to three massless Goldstone 
bosons, the pions, 


SUy (2) x SU4(2) x Uy (1) x UA (1) > SUy (2) x Uy (1) x UA (1). (11.221) 


The anomaly results from the impossibility to preserve the remaining chiral 
symmetry in the quantum theory. Fundamentally, the anomaly results because 
of divergences in the naive, original theory. Then to make sense of the theory 
these divergences must be removed; this is done in a rather brutal fashion and 
is called regularization. The brutality of the regularization means that it seems 
necessary to explicitly break at least some of the chiral symmetry of the original 
Lagrangian. Indeed, there is no known regularization that can preserve all of 


246 Quantum Chromodynamics (QCD) 


the chiral symmetry and it is understood that no such regularization exists. The 
hope and expectation was that once the regularization is removed, the full chiral 
symmetry of the theory would return. The anomaly corresponds to the fact that 
this is not the case. In fact, upon removing the regularization it is not possible 
to preserve both the Uy(1) and the U,4(1) symmetries. 


11.6.3 The Axial Anomaly 


Conceptually, the clearest method for seeing this was discovered by Fujikawa 
[50|°. He considered the fermionic functional integral 


T= | Dippet Eip (11.222) 


and realized that the anomaly comes from the inability to define the functional 
integration measure in a manner that is invariant under all the chiral 
transformations. Here we generalize further, allowing the covariant derivative to 
include gauge fields, in principle, for all the global symmetries. However, we will 
find that some global symmetries are not preserved in the quantum theory, and 
then adding the gauge fields corresponding to those symmetries is inopportune. 
Their quantization makes no sense as renormalizability requires gauge-invariance. 
Thus we imagine adding gauge fields for all symmetries that can be preserved 
at the quantum level. For the case of QCD, this corresponds to gauge fields for 
the colour gauge symmetry SU.(3) and the Uy (1) symmetry. Gauging the chiral 
SUy (2) x SU4(2) actually corresponds to part of the gauge group of the weak 
interactions, but we shall not develop this theory here. We will expand the fields 
slightly differently from Equation (11.213) as 


p= ayr p=) gla, (11.223) 

with 
iDpr =v Pr / daphips = brs (11.224) 

and 
D(b,b) = | [ dada... (11.225) 


For a local axial transformation y(x) > e#?5y)(x) © W(x) +1B(a)75U(x) and 
W(x) > plr) x h(x) +18(x)d)(x)75 the Lagrangian is not invariant 


L(x) > L(x) — (Op 8(x)) 02) I5¥(2)- (11.226) 


However, 


d(x) 4 P(e) =) apre) =X ape? p, (2). (11.227) 


3 We note that Fujikawa used anti-hermitean Euclidean Dirac matrices. We stick with 
hermitean Euclidean Dirac matrices. 
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Then 
al, = =E | daei eP ps (x)as ja= DOr ss (11.228) 


and 


J] [ da, = (detc)-" | | das. (11.229) 


Interestingly, the power of the determinant is —1. This is because the Grassmann 
integration actually behaves a lot like differentiation. Indeed, 


1= f aQa) (Aa) = f da z(a) = JA | da=), (11.230) 


thus J = 1/X and d(àa) = da/à. The determinant of the matrix C,, for an 
infinitesimal transformation is 


= det(C;.)~! det (inti fateptaretteynvels)) 


= exp (<tr (a. + f eetos) 


= exp (-i f ta E etne) (11.231) 


using the expansion of ln(1 +e) ~ e. We must not forget that an equal 
contribution will come from the variation of [],.da,. We wish to evaluate 
A(x) = >, yh (x)75¢r(x); however, the sum is surely hopelessly divergent. We 
regularize it with the eigenvalues of the Dirac operator, taking 


A(z) = im ee (2)yse7Or/M)? oy, (2). (11.232) 


It is this choice of regulator that puts the anomaly in the axial symmetry, 
preserving the vector symmetry in the quantum theory. Another choice can 
preserve the axial symmetry but not the vector. We can choose which symmetry 
we wish to preserve; however, we cannot preserve both. Writing y,(x) = (2|r), 
(note the ket |r) must span the matrix indices of the coordinate wave function 
y(x)) we have 


. = 2 . cy 2 
AC) = in Yerle MP (alr) = im DOTr (a5 ele“ Or! ry (re) 


= lim Sa s (zle P/M)" |p )(rlx)) = lim lim Tr (ys (zle? |y)) 


M- oo Moxy 


= ji lim T jenn? f dtk |k) (k| ) 
= lim lim Tr | y5(a Qn y 


M—œxt>y 


k 
= lim lim Tr (se~ GØ @)/m)? ( TE- Elk) 


M—=>œzt—>y 


dk 
= lim lim Tr Gr eT ÜPE)/M)? pik-s omik: ") (11.233) 


Moo xr-y Ini” 
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and we should be aware that the Tr is over Dirac and internal indices. Now 


iD(x)e*** = ett *(_k +iP(x)), thus 


A(x) = lim lim Tr (f r)i A o 


Mowry 


= lim Tr vs f atk e7 ((- B+ 4D (a))/M)? 
M->0o (27)4 
= lim Tr vs f dk é (Lime hy budy kp tiDy(x))(—ky+iDv (x))/M? 
M-+00 (2Q7)4 
dk — Suu + MW) (hy +iDy(@))(— ky +iDv (2) /M? 
Ee (rs f Gare yo weet 
; dtk ( k+4D(a))?— Dew] Te ) ym? 
z (» J Ori" 
; dtk —k?/M? „(2ik-D(æ)+D?”(z)+}[ |Fuv)/M? 
= lim Tr y5 e e Flu: WF uv 
M->co (27)4 
‘ dtk 12 ),72 1/1 2 
= lim Tr UE: nee (1445 (Hoenn) pata). 


(11.234) 


The first term in the expansion of the exponential that survives the Dirac trace 
is shown and, although there will be other terms in the higher orders that survive 
this trace, they will have higher powers of M in the denominator. The Gaussian 
integral only gives a factor of M4, hence in the limit M —> oo this is the only 
term that survives. Thus we get 


k d*k 1 1 1 1 -painy 
A(x) = lim J (27)4 T( CEASA) FuFar) e™¥ 1M : 


M-+o0o 2 M4 

(11.235) 

Using that Tr(ysYpWwIo7r) = 4€uvor and that the Gaussian integral is 

Ck Sea Ie 
PRUNE 11.2 
f Or 16r? (11:296) 
gives 

(2) = zawor Tr (Eyfa). (11.237) 


Therefore, the fermionic functional integration measure is not invariant under 
axial transformations and transforms as 


D(y, p) + Dah, Gje Tor? FEB envor Tr (Fav For) (11.238) 


where we get twice the variation since both 7 and w contribute to the measure. 
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11.6.4 The U(1) Problem 


With the chiral anomaly, we understand that, at the quantum level, we cannot 
preserve all of the classical symmetry of the action. Only the subgroup U,(1) 
is explicitly broken by the chiral anomaly and the remaining subgroup is 
spontaneously broken SUy (2) x SU4(2) x Uy (1) x SU.(3) > SUy (2) x Uy (1) x 
SU,(3). Under a local axial U,(1) transformation, — Sg, minus the action (that 
appears in the exponent) transforms as, keeping only terms to first order, 


/ dx (iyu Du) > J dab (1 + iB (0)95) (iy Du) (1+ 18 (0) 950 
= [20 ud.) 6- Oubli: (11.239) 


The functional integral under a change of variables must be invariant. If we 
transform to the field Y’ = (1 + ib (x)ys)ẹY, we get 


E / DOT / D(a! Pel EPD- (8) burs 
> J Diop, Gje! EPO (geewenver Tr Fun For) +i drys) of datih 


(11.240) 


Then invariance requires 


1 C 
Ould p = "Tex 2 = a Ewor lr (FuvFor) = ts (Fu Fv); (11.241) 


87? 
where the matrix element on the left-hand side signifies the fermionic expectation 
value of the axial current operator Ops in the presence of the background 
gauge fields. The latter equality is easily obtained for an arbitrary multiplet of 
fermions in a representation with hermitean generators T* of SU(n), and then 
C is the constant in Tr (T°T*) = Cd*. The i on the right-hand side is expected 
and disappears upon Wick rotation back to Minkowski space. 

We can demonstrate the so-called chiral Ward—Takahashi identities, which 
have to do with symmetries, and will be useful in our analysis later. Consider 
the m point function 


_ [D pe $2 g (21): e (£m) 
{Db pes i 


where the ¢'(a;) are local, multi-linear functions of the fermionic fields where Sg 


(H (x1) 0 (@m))4 = (11.242) 


is given in Equation (11.209). With the variations (taken in the opposite sense 
to Fujikawa as in Equation (11.239), to stay with the conventions of Coleman) 


ôy =—iysvda(x) dy = —iby56a(x) (11.243) 


we have 


ôg = Fa OU): (11.244) 
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For example, 
d(by) = —2ipyspda(z). (11.245) 


But changing the variables in the functional integral must not make a difference, 
it must be invariant. Thus we get 


0= (p (21) H” (@m))* = (09B (21) O (Em)) 
+ (S686) -4em)). (11.246) 
Then, since 


— OSE = [dbx bing(—iO,.8a)y5b = | d'e (G,50) jon =- | aerda jig 


(11.247) 
we get 
0 = (bjo) (1) Em) +a — 27) e PE) g(a) 
—2M (h(x) 750(x)$" (21): 7 (2m); (11.248) 


where the last term is there if we add a mass term that breaks the chiral symme- 


try explicitly and we will write j5(x) = ¢(x)y5¢)(x). Then using Equation (11.241) 
and integrating over x, we get 


ou ( eio een) 


ð a i 
= gg C 01) (@m))4 
-5 d'z (FuvFuv) (o! (x1) -G7 (Em) 


But what is the effect of the fermions on the instanton? The instantons must 
still be solutions of the equations of motion 


DuFw=jv with Dyj,=0 and i7,Dyb=0. (11.250) 


These equations have a perfectly good solution, 7) =0 and D, Fy, =0. The latter 
equation is satisfied by the instantons’ and hence the instatons’ configuration is 
unchanged by the fermions. All the previously found formulae must still be valid, 


E(0)/V = —2K cose °° (11.251) 


and 
(O|FF|0) = —6417iK e~™ sin. (11.252) 
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The only change that occurs is that the Gaussian integral about the instanton 
configuration over the gauge fields is appended with a functional integral over 
the fermion fields (which is in a sense also Gaussian as the fermion fields only 
enter quadratically) in the presence of the instanton background 


det(i7,(O. + Ay)) 
det (iy (O,:)) l 


The consequences of this change are profound. The fermionic determinant in the 


K>K (11.253) 


presence of the instanton vanishes exactly, giving 
E(0)/V=0 (6|FF|6) =0. (11.254) 


This means that all the theta vacua become degenerate in energy, that the 
U,(1) symmetry is spontaneously broken. The U(1) problem corresponds to the 
question, “Why is there no corresponding massless Goldstone boson?” What we 
will find is that the massless boson never contributes to gauge-invariant matrix 
elements and therefore is not physically manifested. 

Why does the fermion determinant vanish? It is because in the presence of 
an instanton, there is necessarily a zero energy mode to the Dirac equation. 
Evidently 


[asad = f aoao1 =0, (11.255) 


which is quite unlike the bosonic case 


[ave = | dp1 =c0(= D. (11.256) 


The zero mode follows from a deep theorem, the Atiyah-Singer index theorem 
[8]. However, we can quite easily establish the existence of the zero mode directly 
using the simplest chiral Ward-Takahashi relation. We will work with an SU (2) 
gauge group with one doublet of fermions for simplicity. The Dirac equation for 
eigenmode of energy A, is 


iD, = ArWr (11.257) 
but then 
iD-y5 0b, > — 715i Dy, = —Ar Y5 Yr. (11.258) 


Thus for each mode ~, of energy A, there is a matching eigenmode 75, of 
energy — àr. But what happens if A, =0? Let y? be a zero mode, iy? =0, but 
then obviously iJPy5v2 = 0. We can choose the zero mode to be an eigenmode of 
qs: with pot = 275 Y? we have gay? = y+ with iPy?+ =0 and 4% yF = 
0. The eigenvalue of y5 of the zero mode is called its chirality, which we will 


call x, for zero mode 2. We do not know if 2+ =0 or perhaps y? = 0, or 
possibly neither vanishes (in which case there are two zero modes, of chirality 
+1, respectively); however, both cannot vanish if 72+ +y~2- = y? 40. 
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Let n} be the number of zero modes with positive chirality and n_ be the 
number of zero modes with negative chirality. The Atiyah—Singer index theorem 
states that ny —n_ =v. We can prove this theorem using the chiral Ward- 
Takahashi identities. Consider the simplest identity, without any fields ¢", for a 
single doublet of fermions in SU(2), C = 1/2, we have, 


SDP f deps 
[DU d)eSe 


with Sz = f diry (ip — iM) w and the solutions of the Dirac equation are 
unchanged from the massless case, only the energy eigenvalues are shifted 


—2iv = 2M / de lyy) =2M (11.259) 


i(D— Mop = (Ar — iM Yr. (11.260) 
Clearly, 
i(D — M)y5%r = (—Ar — iM) 750, (11.261) 


but the eigenmodes Yy and y5v, must be orthogonal if \, 40 as they are actually 
eigenmodes of the hermitean operator iJ). We observe 


i, dtrpsysys=0 if A, #0, (11.262) 
but for the zero modes 
/ diry = Xs, (11.263) 
thus 
Jirin = X. xsbsas. (11.264) 
s8,A\s=0 


The Y, are a complete and orthonormal basis of the space of fermion fields, hence 
we can write 


b= arr =X yib, (11.265) 


with Grassmann coefficients a, and br. Then the functional integral is given by 


SIL. dardb, elirAr—iM)brar f dtrpysyp 


-2iv = 2M LO it) 
_ 2 Barcel iM)Xs (11.266) 


as the fermionic integral gives (A, — iM) for all the non-zero modes but a factor 
of 1 for the zero mode in the sum ois d-=0 Xsbsas. The infinite product cancels 
between numerator and denominator for all the non-zero modes, and therefore 
the chiral Ward identity gives 


— 2iv = 25° xs = 2i(n4 —n_). (11.267) 


S 
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Thus v = n- — n4, which cannot be satisfied unless there are at least v zero 
modes. 

For the case v = 1 we can easily show that there is one zero mode with negative 
chirality and no zero modes of positive chirality. We note that the instanton 
configuration is self dual, Fy = F uv. We assume that there is a positive chirality 
zero mode p+ = 0 and ysyt = wt. Then 


1 1 
0= (D)? ytt = E P sl) Di Dye 
1 
= DD T 4 Wus wl Fuh? 
1 
s Deptt + qh Ww) Fa (11.268) 


However, 


1 1 1 
Fs as We) Fv 5 a wh = Pup (—Seworrterr yor 


Y 1 
= —Fuv Ww wyt = — Fur = [ys nly. 


2 
(11.269) 
Therefore, Fur [Yu yu]Y?t = 0 and consequently D?y°+ = 0. Then 
o= f Ata DY) = f EDD), 01.270) 


which is positive unless D,,x°* = 0 identically. Then, in the gauge A; = 0, this 
requires 0379+ = 0. However, this is inconsistent for a normalizable wave function 
except if ~°+ = 0. Therefore, in fact no positive chirality zero mode can exist. 
Of course the analysis fails for a negative chirality solution, we cannot conclude 
F, 
exactly one negative chirality zero mode so that v = n_ — n, is satisfied. 


vu wyt = 0 for a negative chirality zero mode, and there has to be 


Therefore, the fermionic functional integral simply makes the contribution 
from all non-zero instanton sectors vanish. Thus the theta vacua are all 
degenerate in energy, and the chiral symmetry is certainly spontaneously broken. 


11.6.5 Why is there no Goldstone Boson? 


To see the non-existence of a Goldstone boson we must modify our chiral Ward 
identities. The following matrix element no longer makes sense in the non-zero 


instanton sector as the denominator vanishes, 


_ [Dp peT SEIH (a1) a 6" (tm) | 
7 fDi, pje Seb) ; 


however, if we consider just the numerator 


UH (t1) 6" (@m)))4 = J D(p,pje EPG (ay) ++ Gm) (11.272) 


(H (x1): H (@m))4 (11.271) 
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then formally the symmetry properties are identical, and we find 


o , A 
(E -2i ) oeo" eno. (11.273) 
Now the matrix element in a theta vacuum is given by 


(016 (arr) ++" (2m)10)4 f 
_ [DA)eF2 Mei" f Dp, pe SEO) G (21) 0am) 
o f D(A)eSe(ADeiv? f D(p, pje- Seh) 
_ SPIA SED e (G1 (a1) ++ 6 (@m)))4 
7 J Dl Ale SEM eire ((1))4 


(11.274) 


and now the denominator does not vanish for [Dap pe SEO] Æ 0 for the 
sector v = 0. Thus clearly 


(F.- 25) (016% er)---6"(en)|0) =0. (11.275) 
Oa o0 

This is quite interesting. It means that the U,4(1) transformation corresponds 
equivalently to a change in 0, i.e. a U,(1) transformation changes one theta 
vacuum into another. The chiral symmetry is therefore spontaneously broken, 
and the degenerate set of vacua are exactly the theta vacua. 

In summary, we have first found the degenerate, classical vacua and their 
quantum counterparts, |n). Then instantons have the effect of breaking the 
degeneracy obtained by quantum tunnelling between the different |n) vacua, 
and the new combinations |0} = >>, e’”°|n) are the new energy eigenstates with 
spectrum 

E(0)/V = —2K cose *, (11.276) 


where So is the classical Euclidean action of one instanton. The parameter 0 
has nothing to do with chiral symmetry; indeed, there are no fermions yet. 
But once massless fermions are added to the theory, all the effects of the 
instantons disappear, due to the appearance of a fermionic zero mode. The 
|0} states suddenly become degenerate, and a chiral transformation corresponds 
exactly to a transformation of 0. The chiral symmetry is spontaneously broken 
as there exist infinitely many vacua which are transformed into each other 
by the action of a chiral transformation. There is one possible way that 
the system could escape these conclusions, if 0/0a(6|¢'(a1)---¢™(@m)|O) = 
0/00(0|b'(x1)---¢™(&m)|0) = 0, i.e. nothing depends on a or 6. This would 
mean that chiral symmetry is manifest and not spontaneously broken, and the 
vacua |0) are just copies of a single, unique vacuum state. It is easy to dispose 
of this possibility. If we calculate 


(9|o+|6), (11.277) 
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where o4 = (735) w then 
Zs 
ða ~ 
Then if (0|o+|0) #0 we have xox #0, which then requires that the symmetry 
is spontaneously broken. We will calculate (0|o4|0) next and show that it cannot 


=42ic,. (11.278) 


vanish. 

We have already done the Gaussian functional integral about the classical 
critical point, the instanton solution, up to a final integral over the scale size 

2 [®© dr 
K= F ; ye FAM), (11.279) 

where M is a renormalization point scale (represented by a mass), but now we 
should append this result with a fermionic functional integral. For a fermion 
in a background of a configuration of n well-separated instantons and anti- 
instantons, there are n fermionic zero modes to the operator iJ). Then the 
corresponding fermionic functional integral over the corresponding Grassmann 
coefficients vanishes, 


J eardace?*2" 6a) =0, (11.280) 


unless ¢ contains exactly the bilinear G,a,, for each zero mode. This requires 


2n fermionic fields. We are interested in the bilinear o+, which contains two 


fermionic fields. Hence the fermionic functional integral vanishes in all sectors 
of the gauge field except for the sector with n = 1. Indeed, we must have only 
exactly one instanton or one anti-instanton so that there is exactly one zero 
mode. We cannot have a configuration of n} instantons and n_ anti-instantons 


with n} — n- = +1, since this configuration will have n = n} +n- > 1 fermionic 
zero modes and the fermionic functional integral will vanish. 

Then in the sector with just one anti-instanton with n- =1, n+ =0 and self 
dual instanton fields, Fy, = a the fermionic functional integral will have just 
one term 


z 1— = 


= (po) y (det’i) (11.281) 


as the zero mode is a chirality —1, (4) y°- =°-. The fermionic zero modes 
satisfy iP (x) = iyu (ðu + Ay (x))v° (x) = 0. If we move the position of the 
anti-instanton, we change A (£) + Ap (£ +X), then evidently ~~ (x) + Y°- (x+ 
X) and iy, (ð, + Ap (z+ X))~° (x +X) = 0. For the case of an instanton, v = 1 
with n4 =1,n_ =0, which can in principle also contribute, we immediately get 
a vanishing contribution since (Y?+)t (x) (+32) Y+ (x) =0 as the zero mode has 
chirality +1 as (+35) w°t (x) =0. In the denominator only v = 0 can contribute, 
and the Gaussian integral is done around the configuration A, = 0. 
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We perform the fermionic functional integral first as a functional of the gauge 
fields. In the sector v = 1 we must integrate over the position of the anti-instanton 
since nothing depends on the position of the anti-instanton, which gives a factor 


of TV. For o} only the sector v = —1 contributes. The action remains the same 
and e”? = e*® for the two sectors v = +1. Hence finally we get 
© dA =87? det! (ilp) 
Blox |0)= | <e P eg EAM) AA. 11.282 
(losl = | Sge eg OM) TT (11.282) 


Dimensional analysis gives 


det’ 
el) = \h(AM) (11.283) 
det (id) 
for some dimensionless function h(AM), thus 
° dA =F yio 8 
(0lo+|0) = ; yae 2 eg f(AM)h(AM) #0. (11.284) 


This amplitude also satisfies the chiral Ward identity 


2 G10418) = (0t 2104 |8) = -22 #0 [ALE eF -8 FAM)AAM) 
a +10) = T4 0° n ee e™™ g 
(11.285) 
o o 
— +2— | (6|o+|0) =0. 11.2 
(7. +255) (lesl6) =0 (11.286) 
Thus the symmetry transformation, corresponding to a change in a, 
o 
— (O\o+|0) #0 (11.287) 
Oa 


and the symmetry is spontaneously broken. But there is no Goldstone boson. 
Such a boson must be a pseudoscalar and should give a pole at p? =0 in any 
matrix element (now continued back to Minkowski space) such as 


(B|o4(x)o—(a)|8) = $ °(8|o+(a)|n) (nlo (0) |) 


3 
= f ya (Ole (2) GBA GB Ae (0)]0)-- 


=i dp iP, xt iP, -xt 
= f qp Oleh a4 eM GBP) (GBp]o- (0)10) 


a i miret" (Olo (0)|GB p) (GB plo- (0)10) --- 


= J EB cite" P)O (OIGB PGB Fo-(0))0)-~ 
(11.288) 
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where po = |p| and |GB p} is a state with one Goldstone boson of momentum p. 
Then using (note p“p,, = p°) 


sippy) = 23m ( 2 ) (11.289) 


we get 


(Olo (2)o-(2)10) = Im J EE eim ( 1 ) 


p? +e 
(0o (0)|GB p) (GB plo- (0)10)--- (11.290) 


and the singularity appears because of the existence of a massless particle. 
However, in the calculation, there is only a contribution from the sector 


n4 +n- =2, which can be from v = 0,v = +2. Because there is one 0, operator 


and one o_, the sectors with v = +2 corresponding to two instantons or to two 
anti-instantons simply vanish. Only the sector v = 0 remains. Here there are 
two possible contributions, one is the normal, perturbative contribution without 
any instantons. It is straightforward to verify that there is no massless pole in 
the perturbative calculation. The only non-trivial configurations come in the 
sector v = 0 that correspond to a well-separated pair of one instanton and one 
anti-instanton. This contribution will simply be a constant 


(lo (2)o—(0)|8) = (Alo4 (@)|8) (Alo— (0)|4) 


( dr Sane Fil —-8 i 
= Le I? eFg-8 F(\M)h(AM) | (11.291) 
0 


and certainly will not contain a massless pole. Indeed, if we check any matrix 
element of a set of gauge-invariant operators, we will find no massless pole. 
However, if we consider a gauge-variant operator, for example 


Gp = henv arr (Ará F FA, AsAn (11.292) 


then matrix elements with G, inserted will contain a massless pole. This is 

because ô Gp = Euvao Tr (Fur Fo). Hence any matrix element with G, inserted 

must have no pole when contracted with p,,. This implies that the original gauge- 

variant matrix element must have a structure of the form aoe i.e. exactly a 
p? +ic 

massless pole. For example, consider 


(01G (Jo -(0)]8) = | d'pe®etp, 2- (p) (11.293) 
from Lorentz invariance. Then the divergence 


/ d*x(6|3,G,,()o_(0)|6) = Í dêz dtpe™ip?D_(p) (11.294) 


258 Quantum Chromodynamics (QCD) 


must have a pole at p? = 0 if it does not vanish. This would then require 


x_(p) = £ - (11.295) 
pr — ie 
However, f d4z0,,G,, = 3277v, thus 
[a2(010,.6,(0)o- (0)|0) = 32n?(A|a_(0)|0) #0, (11.296) 


where the contribution to the matrix element of ø— is only from the sector with 
v=1. Thus ©_(p) must have a pole at zero momentum. 
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Instantons, Supersymmetry and Morse ‘Theory 


As a final application of instanton methods, we will present an exposition of 
instantons in supersymmetric theories. The great simplification that occurs 
because of supersymmetry is the exact pairing of fermionic states with bosonic 
states, which makes the calculation of the fluctuation determinant very simple. 
The fermionic determinant exactly cancels the bosonic one. 

Morse theory and the Morse inequalities concern the critical points of a 
function defined on a compact, Riemannian manifold, and the global topological 
aspects of the manifold. It was the genius of Witten [125, 123, 124] to point 
out that there is a deep connection between Morse theory and supersymmetric 
quantum mechanics defined on a manifold. This is what we hope to recount in 
this chapter. 

We will require some familiarity with certain concepts from differential 
geometry which we will review here, but the reader should refer to more detailed 
texts [42, 60, 26] for a more complete picture. 


12.1 A Little Differential Geometry 
12.1.1 Riemannian Manifolds 


We consider a compact, n-dimensional Riemannian manifold. A manifold is a 
point set with a topology (the definition of the open sets in M) that is locally 
homeomorphic to R”. This means that each point in the manifold is contained 
in an open subset U; of the manifold which can be mapped to R” by a 
homeomorphism ¢; Ð ¢;(U;) © R”. Homeomorphism means the mapping takes 
open sets in U; to open sets in R”. The set of the U;s cover the manifold, i.e. 
U;U; = M. Any such set of Ujs is called an atlas and each individual U; provides 
a coordinate chart. If two different coordinate charts U; and U; have a non- 
empty intersection, U; N Uj # Ø, then the function ¢; o z` which maps points 
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in j(U; N Uj) > @(UiNU;), ie. defines a function from R” > R” must be 
k times differentiable. This defines a C* manifold. We will always simply take 
C™ manifolds. The (local) coordinates of each point in a given U; are just the 
coordinates of the point to which it is mapped in R”. 


12.1.2 The Tangent Space, Cotangent Space and Tensors 


The manifold has a tangent space at each point P, Tp(M), which is defined as 
the space of linear mappings of real-valued functions defined on the manifold 
to the real numbers which satisfy the Liebniz rule, v(fg) = (Uf)g + f(vg). The 
dimension of the tangent space is also n. The elements of the tangent space are 
called vectors. A basis of the tangent space can be trivially given in terms of a 
system of local coordinates. If xê are a set of coordinates at a point P of the 
manifold, then any linear mapping that satisfies the Liebniz rule, on the space 
of functions defined on the manifold at the point P can be defined by 


T: f(z) > R2af)=v'dif(x)|p. (12.1) 


Thus a vector is equivalent to a set of n components ọ = (v',v?,---,v”). If the 


components of the vector are smoothly varying functions of the coordinates vt (x), 
then we define a vector field. The cotangent space T (M) at the point P is simply 
defined as the dual vector space of the tangent space at the point P. The dual 
vector space of a given vector space is simply the space of linear mappings of the 
vector space to the real numbers, thus T (M) :Tp(M) — R. The dimensionality 
of T3(M) is also n. If we have an arbitrary basis FÆ; of T,(M), then the dual 
basis of T5(M) is defined by the condition 


(Ei t) = 6. (12.2) 

We name the dual basis to the coordinate basis 0; using the notation dx/ so that 

(0;,da7) = ð. (12.3) 

A general dual vector or “co-vector” can be written as i* = ujdx and then for 
a general vector y= v'0; we have 

(0, a@*) =v'u,;(0;, dx) = viuj? = vu. (12.4) 

If we change our system of coordinates of the coordinate chart at the point p, 

as zi, then the coordinate basis vectors of the tangent space transform simply 


as 0; = oa z ð; or equivalently = Oa O; But then the new dual basis vectors 
F 1 k R i . 
must be given by dz” = On" xt or equivalently dx’ = oe dx’? so that the inner 


product between 0; and dx is preserved, i.e. 


; Oa! ., Out Ox'® Ox) 
(ði, da?) = (Fr sari" ~ Dat Det (kr da") 
Ik j Ik Í , 
SOR Ox? q Or”! OF w (12.5) 


art Ox * ~ Əri ðr 
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This then gives the transformation properties of the covariant and contravariant 


è 7 n , i Š 
components of of vectors and co-vectors, indeed, v = v” 0; = v” oz 9, = vid; 


ðx'İ 

: 1j ne g ! TE Ij 

üx = udz" = wl. dri = u;dzt i — y'i 2e ; = ul O22 
and üx = ujdx uj x dx uidz'. Hence v Dt aor and u; = uj aT Or 

p li — „j 3x” I1 — 8x ` 
equivalently v” = v T and u; = uj au? Then the inner product between 
arbitrary vectors and co-vectors is invariant 

(0, U") =v'uj =v" ul. (12.6) 


We note the possibly confusing nomenclature: the components of vectors are 
said to transform contravariantly while the components of co-vectors are said to 
transform covariantly. 

We can also take tensor products of the tangent space k times and the 
cotangent space l times, 


Tp ®-:-@Tp @Tp®::-@Tp 
a am N——— ee” 


(12.7) 
k l 
to define tensors (and tensor fields) 
t= ti fn OOH; Ode" D- @da. (12.8) 


We should stress that at this point there is no relationship between the tangent 
spaces, the cotangent spaces and their tensor products over distinct points. The 
construction is independently done over each point. To use a leading terminology, 
there is, at the moment, no connection between tangent spaces at neighbouring 
points. The ensemble of the tangent spaces over all the points in the manifold 
defines a larger manifold called the tangent bundle, a fibre bundle over the 
manifold M. The base manifold is M and the fibre is Tp over the point P in 
M. There is also the corresponding cotangent bundle constructed with the co- 
tangent space. The complete spaces are fibre bundles, spaces that locally permit 
a decomposition into a Cartesian product of a patch of the base manifold M 
cross the fibre, which would be the tangent space in the case of the tangent 
bundle, etc. 


12.2 The de Rham Cohomology 
12.2.1 The Exterior Algebra 


The de Rham cohomology concerns the ensemble of the set of spaces of the 
completely anti-symmetric tensor products of the dual tangent space. We start 
with the cotangent space, TŽ. Any two basis elements dx and dy can form an 
anti-symmetric two-co-tensor defined as 


1 
dz \ dy = 3 (da 8 dy — dy @ dz). (12.9) 
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The product A is called the Cartan wedge product or the exterior product. Then 
an arbitrary anti-symmetric two-co-tensor is given by 


t= tijdz’ Adz). (12.10) 


This construction obviously generalizes to the notion of anti-symmetric p co- 
tensors constructed over each point x of the manifold. The set of anti-symmetric 
p co-tensors forms a sub-space of the p-fold tensor product of the co-tangent space 
which we will call A? (x). The set of A? (x)s for all the points of the manifold forms 
a fibre bundle over M. The elements of A?(a) are called differential forms, or 
more precisely p-forms. The dimensionality of A? (x) is obviously Cs) the number 
of ways of choosing p basis vectors from the total set of n basis vectors. We add 
in A°(x) = R, simply the real line, and then we have n +1 spaces of differential 
forms, A? (x) to A” (x), since for A”+!(x) or higher, it is no longer possible to 
anti-symmetrize n+ 1 or more co-vectors and these spaces are just empty. The 
space of smooth p-forms corresponds to the choice of the anti-symmetric tensor 
component fields f;,...;, (a), the corresponding tensor field being f;,...;,(a)dx"! A 
--- A dz'?, which we write as C°(A?) which is a space of dimension G It is 
obvious that C% (AP) and C®(A”7P) have the same dimensionality. The wedge 


product serves as a product on the full space of the direct sum of all possible 
anti-symmetric tensor fields 


A =A OAOA", (12.11) 


which then defines an algebra called Cartan’s exterior algebra. 


12.2.2 Exterior Derivative 


We can define the exterior derivative of a p-form, an operation d, which takes p 
forms to p+ 1 forms 


d: C>(AP) > 0%(APH) > 
dfai la)da"t Ande) = ( 2 


qi inte w) dx? Adz’! A- Adz”. 
x 


(12.12) 


Note the placement of the additional dx’ by convention to the left of all the 
other differential forms. Obviously 


ddwp =0 (12.13) 


for any p-form wp. The chain rule also simply follows, for wp a p-form and xq a 


q-form 
d(wp A Xa) = (dwp) A Xa + (—1) wp A (dXq)- (12.14) 
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As C%(A?) and C®(A”7P) have the same dimensionality, we can define a 
duality mapping between these spaces, called the Hodge « duality transformation. 
We define 


«1 C®(AP) > C™(AT-?) > 


1 iy-tp 


«(dxt A+.» Adz?) = „dat? A... A dx” 


(12.15) 


where qe ia is the completely anti-symmetric tensor in n dimensions. We have 
been careful about keeping indices up or down; however, it is important to point 
out that nothing we are doing requires the definition of a metric on the manifold. 
The exterior algebra and exterior differentiation does not depend on a metric. 
We note that 


# kwp = (—1)P-P uw, (12.16) 


12.2.3 Integration 


The space C®(A”) is one-dimensional, there is only one n-form, dr! A- +- Adz", 
thus it is easy to see that dr A---A dx’ = et "'mdr! A- A dx”. This form 
can be identified with the volume form on the manifold and we can define the 
integration over the manifold with this volume form; one simply integrates in R” 
in the charts of any given atlas, making sure not to double count the contributions 
from regions where the charts intersect. The integration is independent of the 
coordinate system, since the volume form transforms exactly by the Jacobian of 
the coordinate transformation, dx! A --- A dx” = det or dx A++- A dx". This 
integration generalizes trivially to integration over sub-manifolds of M of a given 
dimensionality p of a p-form defined over the sub-manifold. With the notion of 
integration, we can define an inner product on the space of p-forms 


(Wp, Xp) =/ Wp N*Xp: (12.17) 
M 


In terms of the coefficients, wp = Wiz --ipdT*1 A---A da"? and Xp = Xjy -jpa £I A 
--- A dx)? then 


(wp: Xp) =f Waa eee A+++ Adz"? A #OGeapdo” A+++ Adat) 
M 


= fae Carats te 1A... ip JL Ip Jptl A... Jn 
fs Oe OM ghee Adr? A a-p drt in PHLANA. Adr 


1 Bera eae . 
ates anaes, We JV Ip ta tpIpt1Indpl N... n 
a ip XG io (np)! tiin" dx ^- Adr 


=p! | Waa Xj jph oe ede A Ada”, (12.18) 
M 
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The inner product is symmetric, (wp, Xp) = (Xp, Wp). 


The next structure we will define is the adjoint of the exterior derivative, which 
we call 6. The inner product 


(wp. dxp-1) = f up Ndxy1= | (dxp—1) A *Wp 
M M 
= iia ie aaa em, 
M 
= f -Pyp AO POD sade 


= ((-1)"? 441 x d*wp) A *Xp—1 
M 


= (duppi) (12.19) 
where we have used trivial identities such as (—1)?” = 1. Thus 6 = (—1)"?*"t! x 
dx, and note for n even the sign is always —1 and 6 = — * dx, while for n odd we 


get 6 = (—1)? «dx. It is also easy to see ddw, = 0. 

The exterior algebra naturally gives rise to a Stokes theorem for manifolds 
with boundaries. If 0M is the boundary of a p-dimensional manifold M and 
Wp—1 is an arbitrary (p— 1)-form, then Stokes theorem states 


i diy 1 =| Wp—1- (12.20) 
M ƏM 


This theorem contains and generalizes all three of the usual Green, Gauss and 
Stokes theorems that are taught in an elementary course on vector calculus. 


12.2.4 The Laplacian and the Hodge Decomposition 
The Laplacian is now defined as 
V? = (d+ 8)? = dô + ôd. (12.21) 


The Laplacian does not change the degree of the form. The Laplacian is a positive 
operator as 


(wp, V’ wp) = (wp, dôwp + dwp) = (wp, wp) + (dwp, dwp) > 0, (12.22) 


assuming there are no boundaries. Therefore, V?w, = 0, and then w is called a 
harmonic p-form, if and only if both dwp = 0 (we say wp is closed) and dw, = 0 
(we say wp is co-closed). 
A p-form that can be globally written as the exterior derivative of a p— 1-form, 
i.e. 
Wp = dCp—1 (12.23) 
is called an exact p-form while if w can be globally written as 


wp = bE pti (12.24) 
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then it is called a co-exact p-form. The Hodge theorem states that on a compact 
manifold without boundary any p-form w, can be uniquely decomposed as the 
sum of an exact form, a co-exact form and a harmonic form 


Wp = dCp—1 + Ep+1 + Pp, (12.25) 


where pp is a harmonic form, meaning that V? pp = 0. 


12.2.5 Homology 


The homology of a manifold is the set of equivalence classes of sub-manifolds 
called cycles, boundaryless collections of sub-manifolds of dimension p, which 
differ only by boundaries. We start with our initial n-dimensional manifold M. 
Then we define a p-chain as a formal sum of p-dimensional, smooth, oriented, sub- 
manifolds, M? , the formal, finite sum being written as ap = >>, ciN? , where c; are 
real, complex or integer, or even in the group Z2, giving rise to the corresponding 
p-chain. We continue to use the symbol 0 as the operator that corresponds to 
taking the boundary, day = >,cj0N?, which is evidently a (p — 1)-chain. Let 
Zp be the set of boundaryless p-chains, which are called p-cycles. This means 
Qp E Zp > ap = Ø. Let Xp be the set of p-chains that are boundaries, i.e. 
Qp E Xp > Qp = Oay+1. Since the boundary of a boundary is always empty, 
Xp C Zp. Then the simplicial homology of M is defined as the set of equivalence 
classes Hp 

Hy = Zp/Xp, (12.26) 


i.e. the set of p-cycles that only differ from each other by boundaries are 
considered equivalent, ap ~ a), > Qp = a), + OMp41. Hp is obviously a group under 
the formal addition. The formal sum of two p-cycles commutes with the process 
of making equivalence classes with respect to p-cycles which are boundaries. The 
integral homology groups are the most fundamental, we can get the real, complex 
or Z2 homologies from them. We will write the homology groups as H,(M,G), 
where G =C, R,Z,Z2. H,(M,G) =Ø for p>n. 

Ho(M,G)=G if M is connected, since 0-cycles are just collections of points, 
the boundary of a point is empty. We can reduce any finite collection of points 
with arbitrary coefficients to a 0-cycle consisting of single point, P € M. Any 0- 
cycle, œo = >>; ¢ P;, can be reduced to single point P with a coefficient $., c; € G, 
using 


a= oP =) (GPP +P) ~ (= a) P (12.27) 


as every pair of points with alternating coefficient, as appears above c;P; — c;P, 
is the boundary of a 1-cycle corresponding to any curve joining the two points. 
However, cP is not equivalent to ČP for c#čE€ G, thus the elements of Ho(M,G) 
are in a one-to-one correspondence with G. Obviously H,,(M,G) =G also, since 
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there is only one sub-manifold of dimension n, M itself, in M, and we see that 
Ho(M,G) = H,,(M,G). This generalizes to what is called Poincaré duality, when 
G=R,C or Zo (all fields) we have H,(M,G) = Hn-p(M, G). 

Finally, for G = R,C or Zə the homology group H,(M,G)) is clearly a vector 
space over G. We define the cohomology group H?(M,G) simply as the dual 
vector space to H,(M,G). 


12.2.6 De Rham Cohomology 


We define the de Rham cohomology group with respect to differential forms for 
G = R,C. With the definitions Z? as the set of closed p-forms and X” as the set 
of exact p-forms, the de Rham cohomolgy group is defined as 


H®,.(M,G) = Z?/X?, (12.28) 


i.e. the equivalence classes of closed modulo exact p-forms, w, ~ wp => Wp = 
wp + dap_1. For the special case of H’ pR(M,G) we define this as the space of 
constant functions, as their exterior derivative vanishes. A zero-form cannot be 
the exterior derivative of any “—1” form, as these do not exist. The spectacular 
conclusion of the de Rham theorem asserts that these cohomology groups are in 
fact identical to the simplicial cohomology groups and hence dual to the simplicial 
homology groups. 

We define the inner product of a p-cycle ap € Zp with a closed p-form wp € ZP 
through the integral 

It ( Gig, ; Wp) =| Wp- (12.29) 
xp 

It is easy to see that this inner product only depends on the equivalence class of 
Qp and of wp. Indeed, 


f (wp + dpa) = f unt f xia f wt f gais] Wp, (12.30) 
Qp Qp Qp Qp dap Qp 


and 


f up= f up | w= f upt f d= | Wp (12.31) 
ap+0bp+1 ap OBp+1 ap Bp+1 ap 


as Oa, = Ø and dwp = 0. Thus 7 gives a mapping 
r: Hyp(M,G) 8 Hip(M,G) >G. (12.32) 


De Rham proved the following theorems. Let {c;},i = 1,--- ,dim(Hp(M, R), 
be a set of independent p-cycles that form a basis of H,(M,R). Then 
1. For any given set of periods v;,i =1,--- ,dim(Hp(M, R) there exists a closed 
p-form w, such that 


Vi = T (Ci, Wp) z| Wp. (12.33) 


4 
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2. If all the periods vanish for a give p-form wy, then wp is exact, i.e. Wp = dX p_1. 


This means that if {w;} is a basis of p-forms of H?p(M, R) then the period 
matrix Tij = 7(c;,w;) is invertible. This is equivalent to saying that H},(M, R) 
is dual to H,(M, R). Consequently, the de Rham cohomology and the simplicial 
cohomology are naturally isomorphic and can be identified. 

The Hodge theorem asserts that for each de Rham cohomology class there is 
an essentially unique harmonic form that can be taken as the representative of 
the class. Indeed, we have from the Hodge decomposition 


Wp = dCp—1 + ÔEp+1 + Pp- (12.34) 


Then evidently, the exact form d¢,_, is irrelevant in determining the equivalence 
class. wp being closed and pp being harmonic, thus dw, = dp, = 0 which implies 
that ddé,41 = 0, but then 0 = (€)41,d6&)41) = (dp41, 0€p41) requires 6&)41 = 0. 
Thus wp = d¢p_1 + pp and the de Rham cohomology class of wp is determined 
by the unique harmonic form p, in its Hodge decomposition. This fact will be 
very important in the supersymmetric quantum mechanics that we will analyse 
in the later sections. 

We define the Betti numbers as the dimension of the homology groups and 
consequently also the cohomology groups 


B, = dim(H)(M, R)) =dim(H?,,(M, R)) =dim(H?(M,R)), (12.35) 


where B, is the pth Betti number. The alternating sum of the Betti number is 


the Euler characteristic 
n 


x(M) = 51B, (12.36) 
p=0 
and we will see it is a topological invariant of the manifold. Morse theory relates 
the critical points of functions defined on a manifold to its Betti numbers. 


12.3 Supersymmetric Quantum Mechanics 


After this brief, condensed exposition of manifolds, structures defined on 
them and of the de Rham cohomology we can now move on to show how 
supersymmetry and instantons can be used to prove the global topological results 
framed in the Morse inequalities [62]. 


12.3.1 The Supersymmetry Algebra 


In any quantum theory we can separate the Hilbert space H into H=H* 9 H7, 
where Ht and H- are the subspaces of bosonic and fermionic states, respectively. 
A supersymmetry corresponds to a transformation generated by conserved 
hermitean operators Q;,i=1,---,N that maps H* to H~ and vice versa. We also 


268 Instantons, Supersymmetry and Morse Theory 


define the operator (—1)", where F is the fermion number. Then (—1)"|w) = |Y) 
for |) € H+ while (—1)” |Y) =—|2) for |) € H7. The supersymmetry generators 
must anti-commute with (—1)", {Q;,(—1)"}=0, which means that they are 
fermionic operators. On the other hand, they commute with the Hamiltonian 
H, [Q;,H] = 0, which means that they are conserved. Finally, to define a 
supersymmetric theory we also impose Q? = H for any i and {Q;.Q;}=0 for 
i Æ j, together giving 
{Qi-Q5} = 28H. (12.37) 
This definition of supersymmetry does not allow for Lorentz-invariant theories. 
This is because Lorentz transformations combine the Hamiltonian to the 
momentum generators. In 1+ 1 dimensions we have only one momentum 
generator, P. The simplest algebra preserving Lorentz symmetry requires two 
supersymmetry operators, Qı and Q2 and the algebra 


QÌ=H+P, QR=H-P, {Q1,Q2}=0. (12.38) 


This is compatible with the idea that (H, P) transform as a vector and (Q1,Q2) 
transform as a spinor. There is just one generator of Lorentz transformation M, 
taken hermitean, and 


: .1 1 
[M,H]=iP, [M,P]=iH, [M,Qi]=i5Q1, [M,Q:]=-i5Q2. (12.39) 
Then, for example, [M, H + P] = i(H + P), which is compatible with 


[M,Q}] = [M,Q1]Q1 +Qı[M,Q1] =1}Q1Q1 +Q1i}Q1 = iQ} = iH + P). 
(12.40) 
From Equation (12.38) we easily find 


H= (+Q) (12.41) 


and therefore the Hamiltonian is positive semi-definite. Also, [H,(—1)"] = 
[P,(—1)*] =0 as they are quadratic in the supercharges, hence the Hamiltonian 
and the momentum generator are bosonic operators. 
If there exists a single state |0) in the Hilbert space that is annihilated by the 
supercharges 
Q,\0)=0 i=1,2, (12.42) 


then the supersymmetry is unbroken. Such a state obviously has zero energy 
and, since the Hamiltonian is positive semi-definite, |0) is the vaccum state. 
If there are many solutions to Equation (12.42), then the supersymmetry is 
also unbroken, but presumably the Hilbert space separates into superselection 
sectors of states constructed over each vacuum. If there are no states that 
satisfy Equation (12.42), then the supersymmetry is spontaneously broken. 
It is generally quite difficult to directly prove the existence of solutions to 
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Equation (12.42), or the lack thereof. However, the following indirect method 
sheds light on the question: one computes the index of one of the supersymmetry 
generators. 

We are looking for states that are annihilated by both supersymmetry 
generators, Q;|0) =0. Then with the algebra (12.38) it is easy to see that P|0) =0, 
thus we can restrict to the subspace Hp—p, which is all states annihilated by 
P. This subspace also splits into a bosonic and a fermionic subspace, Hp—p = 
Hic © Hpo. Within this subspace, Q? = Q3 =H, restricted to Hp=o, we can 
look for states that are annihilated by one of the supercharges, call it Q, where Q 
could be Q1 or Qə or a linear combination of the two. Q necessarily can only take 
a State in Hbo — Hpo and a state in Hpo > Hisp: Q has no other action. 
This fact then allows for the decomposition Q = Q,+Q_, where Q4 acts only 
on and maps Heeg — Hp_, while Q- acts only on and maps Hp_, > Hbo: 
Q- is the adjoint of Q+. The index of Q restricted to Hp po is then defined as 


index(Q) = dim(Ker(Q4)) — dim(Ker(Q_)), (12.43) 


where Ker(Q+) is the subspace of H5_, that is annihilated by Q+. If the index 
is non-zero then we know for sure that there are states that are annihilated by 
Q and hence supersymmetry is unbroken. The index(Q) can be written as 


indea(Q) = Tr(—1)* =ng(E =0)—nr(E=0) (12.44) 
as the bosonic zero modes in Hbo count as +1 for each mode and the fermionic 
zero modes in Hp_, count as —1 for each mode. The non-zero energy modes 
are necessarily paired because of the supersymmetry, and hence cancel pairwise 
in their contribution to the trace. The index being non-zero requires necessarily 
that there exists at least one zero energy state and hence we can conclude that in 
this case the supersymmetry is unbroken. In the sequel we will drop the subscript 
P=0 and take as given that we are working in the subspace with P =0. 


12.3.2 Supersymmetric Cohomology 


The Hamiltonian is given by 


H=QQ'+Q'Q (12.45) 
with the superalgebra 
Or =QC=0; (12.46) 
with consequently 
[H,Q] =[H,Q"] =0. (12.47) 


There also exists the operator (—1)” and usually the fermion number operator 
F, which both commute with the Hamiltonian. The states in the Hilbert spaces 
are graded by the eigenvalue of (—1)". Bosonic states, a subspace denoted by 
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H”, take eigenvalue +1 while fermionic states, a subspace denoted by H7, take 
the eigenvalue —1. The fermion number operator is integer-valued, with bosonic 
states having an even number of fermions and fermionic states having an odd 
number. The Hamiltonian maps bosonic states to bosonic states and fermionic 
states to fermionic states, while the supercharges switch the two, mapping 
bosonic states to fermionic states and fermionic states to bosonic states. 


H:Ht>Ht, H >H 
Q,Q':H* >H, H oH". (12.48) 


If we write the energy levels in an ordered list Eo < Ey <--- then the Hamiltonian 
preserves the energy-level subspace and the Hilbert space can be decomposed in 
terms of subspaces H, of fixed energy levels 


H = Hm (12.49) 


with the action of the Hamiltonian, the supercharges and (—1)”’ satisfying 
Hl, = Em, Q,Q',(-1)* Hm + Hy. (12.50) 


The energy-level subspace further decomposes into bosonic and fermionic 
subspaces Hm = H}, @H;, and while the Hamiltonian preserves the bosonic 
and fermionic subspaces (they are indeed eigensubspaces of the Hamiltonian) 
the supercharges exchange the two 


Q,Q': H}, > Hn, Hp > Ht. (12.51) 


The action of the operator Q twice, vanishes, Q? = 0. Thus we have the exact 
sequence: 


(12.52) 


An exact sequence means that the image of a given map in the sequence is the 
kernel of the subsequent map. This is called a Z2-graded complex of vector spaces 
as the fermionic and bosonic Hilbert spaces are graded with the Zə charge with 
respect to the operator (—1)”. This gives rise to the cohomology groups: 


H*(Q) = Kernel {Q: Ht —> H } /Image {Q : H7 + Ht} 
H- (Q) = Kernel {Q :H >H*}/Image{Q:H' +H}. (12.53) 
We can further refine this complex by noting that at energy level Em £0, the 
action of Q does not take you out of the energy sector, since Q commutes with 


the Hamiltonian. If a vector |Em) is Q closed, Q|Em) = 0, i.e. in the kernel of 
Q, then it is necessarily exact, i.e. in the image of the previous map, since 


|Em) =H|Em)/Em = (QQt + QQ) |Em)/Em =Q(Qt|Em)/Em). (12-54) 
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Hence all states that are closed are also exact for all the non-zero energy levels, 
and thus cohomology groups are just determined by the states in the zero energy 
sector. For a state of zero energy | Eo) we have 


0 = (Eo|H| Eo) = (Eol (QQt +. Q'Q) | Bo) = [Qt Eo) |” +/Q|Bo)[?, (12.55) 


which is only possible if both Q|Eo) = 0 and Q'|E 9) = 0. Thus the zero energy 
states are annihilated by Q and hence closed. But none of them are exact, | Eo) 4 
Q\a), since, if this were true, Qt|Eo) = 0 = Q'Q|a), which implies (a|Q'Q|a) = 
|Q|a)|° = 0, which is only possible if Q|a) = 0. Thus the cohomology groups can 
be identified with the set of zero energy states: 


H+ (Q) = Hy 
H- (Q) = H5 (12.56) 


where Họ are the states of zero energy. 

Q takes states of p fermions to states of p+ 1 fermions. It is reasonable to 
assign vanishing fermion number to states without fermions, and the action of 
Q an even number of times always gives back a bosonic subspace, while an odd 
number of times give us a fermionic subspace, hence with the notation that H? 
is the subspace of states of p fermions, we have: 


Ht = Yp éven HF. 
H = Dp oaa HP. (12.57) 


Then the Zə-graded exact sequence in Equation (12.52) becomes a Z-graded 
exact sequence 


Sej se Se SO 


(12.58) 


and we can define the cohomology group at each p: 
H?(Q) = Kernel {Q : H? + H?*"} /Image {Q : HP! + HP}. (12.59) 
The Witten index then becomes the “Euler” characteristic of the complex 
Tr(-1)"= X` (-1)?dim(H?(Q)). (12.60) 
p=0,1,--- 
12.3.8 1-d Supersymmetric Quantum Mechanics 
Consider the action 
f 1 }, . K 
s= [an= fae (53% 5 Wey? +5 (wh- iu) -wete 
(12.61) 
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The variables ù and yt are anti-commuting variables which will eventually be 
realized by the exterior derivative or some deformation of the exterior derivative. 
For the moment we just impose 


{y yt} =0. (12.62) 
The supersymmetric transformation is 
r= ept = ely 
ôy = e (iż +h’ (x)) 
ôy? = el (~it + h (x)), (12.63) 


where € = €; + 7€2 is a complex fermionic parameter. It is reasonably easy to 
see that the action is invariant under the supersymmetry transformation. The 
conserved supercharges can be obtained by Noether’s theorem 


Q= yl (iż +h'(x)) 
Qt =p (-i#+h'(x)). (12.64) 


Quantizing the system corresponds to imposing the canonical commutation and 
anti-commutation relations 


[zp] =i 
{y yi} =1 (12.65) 
as the canonically conjugate momenta are p = 0L/ðt and ty = OL/ Ow = int, 
with {7,7} =i. (The convention taken with Grassmann derivatives is action 


from the left, 0y1%2/0w, = —we, in the final analysis, it is just the algebra of 
the operators that counts.) The Hamiltonian is given by 


1 1 1 
H= sp? +5 (k (2)? + 5h" (2) (Yth oy). (12.66) 
The fermion number operator is F = yt% and satisfies the commutation relations 


As {w,v} =0= {opt ptt the fermionic fields satisfy the algebra of fermionic 
annihilation and creation operators and, if there exists the state |0) that. is 
annihilated by w, which we assume 7|0) = 0, then the state %t|0} is the only 
other independent state in the theory. Evidently Yyt|0} = |0) and #tqt|0) = 0. 
Thus we can write the fermionic operators as 


0 1 0 0 
»=( 0 a ae a (12.68) 


The full Hilbert space of the theory will be the Hilbert space of the bosonic 
variable x, which is the space of complex-valued square-integrable functions of 
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the variable x denoted by L?(R,C), multiplying (tensored with the states |0) 
and the state ~'|0)), 
H = L?(R,C)|0) $ L?(R,C)v"|0) (12.69) 


the first component is identified with the bosonic subspace and the second 
with the fermionic subspace. The supercharges remain form-invariant from their 
classical expressions 


Q = 4' (ip + h'(x)) 
Qt =4(—ip+h'(x)), (12.70) 


and commute with the Hamiltonian. We can compute, with a little straightfor- 
ward algebra, that indeed 


{Q,Q"} = 2H, (12.71) 
hence the supersymmetry algebra is satisfied. 
The supersymmetric ground states are determined by the two conditions: 


Q|Eo) = ( PTAA A ) cl aa 


Q! Eo) = ( a |E) =0. (12.72) 
Expanding |Eo) = €,(x)|0) + €2(x)~t|0) gives 
(+H) le) =0 
(-z + w) €2(x) =0, (12.73) 
which are trivially solved as 
& (£) = ce") 
f(x) = ce"), (12.74) 


Obviously these solutions cannot both be square-integrable and the square- 
integrability depends on the behaviour of h(x) as z — +oo. The four cases 


are limz+to0h(x) = +00, lim, 4+.h(x@) = Foo, limg+4.0h(x) = +00 and 


limy-++00 h(a) = —oo, the first case being equivalent to the second. The first two 


cases yield no square-integrable solution and hence there are no supersymmetric 
ground states and Tr(—1)” =0. The latter two yield a solution with either c2 =0 
or cı = 0; in each case there is one supersymmetric ground state, bosonic if cz = 0 
yielding Tr(—1)* =1 and fermionic if cı = 0 yielding Tr(—1)" =—1. Thus we 
know exactly that in the first two cases there are no supersymmetric ground 
states, while in the latter two cases there is exactly one, which is bosonic if the 


potential rises to +00 as x — +00 and fermionic if the potential falls to —co as 


x — +00 . We underline that these are exact results. 
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12.3.3.1 Supersymmetric harmonic oscillator The example of a harmonic 


oscillator is particularly simple; here we take h(a) = Sa, Then the potential in 


2 


2 
our Hamiltonian is $(h’/(x))? = 4-2? while the coefficient of the fermionic term 


is h(x) =w. Thus the Hamiltonian is given by 
l > : w? 2 
H= 5P tS x 


The harmonic oscillator has spectrum 


+50 (ptp wy). (12.75) 


1 
e= (r+3)o n=0,1,2,--- (12.76) 


for eigenstate ¢,(a), which are known to be Hermite polynomials multiplied by 
a Gaussian. The fermionic part yields the matrix 


wf —1 0 
TER] F 


which commutes with the harmonic oscillator and has the spectrum € = (- z5 g). 
Thus the spectrum of the Hamiltonian is for w > 0, 


_ jJ nw for ¢n(x)|0) B = 
cam (n+1)w for ¢n(x)w"|0) n=0,1,2, (12.78) 


and for w <0 


_ J (n+1)|w| for ¢,(x)|0) 7 T 
i -{ njw ro E Oa (12.79) 


We notice that for positive w we have a bosonic zero mode but for negative w 
the supersymmetric zero mode is fermionic. 


12.3.4 A Useful Deformation 


We will next consider a deformation where the supersymmetric harmonic 
oscillator corresponds to the lowest-level approximation. Consider the theory 
with h(x) replaced with th(x), where t is just a parameter (in no sense the 


time). 
h(x) > th(x). (12.80) 
Then the Hamiltonian becomes 
1 t? t 
He = sp +g (h(a) + 5h" (a) (Wid — py) (12.81) 


and we are interested in what happens as t > oo. In this limit, the potential 
2 

E (h'(£))? becomes very large for most values of x, and the wave function is 
pushed into regions where the potential is small. The potential is small only at 
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the places x; where h’(x;) = 0, i.e. critical points of the function h(x). Around 
critical points, the potential can be approximated in the lowest approximation 
as a quadratic polynomial ~ (x — x;)?, which brings us back to the harmonic 
oscillator that we have just analysed. The frequency of the harmonic oscillator 
becomes tw, where w = h” (x;) at the critical point, and then the energy levels 
are linear in t. The fermionic term now has coefficient th’ (a;) = tw, and thus 
also gives a linear contribution in t to the energy, which exactly cancels the 
oscillator ground-state zero-point energy for the bosonic case if w > 0 and for 
the fermionic case if w <0, just as we have seen explicitly above for the exact 
harmonic oscillator. 

Thus we are left with exactly one energy level at each critical point whose 
energy does not scale linearly with t. The energy of the state is zero in the 
approximation that we have employed. It may well be exactly zero, but this is 
not yet determined. However, we do know that without any approximations there 
is only one or no exact supersymmetric ground state in the theory, depending on 
the asymptotic behaviour of h(a). Thus all or all but one of the zero-energy levels 
that we have found approximately must in fact have non-zero energy. What will 
be clear is that the exact energy levels of the corresponding exact eigenstates, 
which are concentrated about the critical points of h(x) (as we have found to 
be approximately the case), will not scale linearly with t. To first order in the 
approximation, they are zero-energy modes. Perturbatively, they will actually 
remain zero-energy modes to all orders. Their energy can only become non- 
zero through non-perturbative corrections. These non-perturbative corrections 
are just instanton corrections, corresponding to tunnelling transitions between 
the perturbative zero-energy modes. 

Expanding the function h(x) about a critical point x; where h’(x;) = 0, 
and assuming h’’(x;) #0, which simply means that the critical points are 
non-degenerate, we have 

h(a) = hai) + Sh! (e) (0-1)? + Eh" (ai) (a r) (12.82) 
and evidently 


1 
h' (x) = h” (ay) (a — zi) + gt e — 24)? ++, (12.83) 
Scaling £ — x; > @— č; = (x —2;)/Vt and correspondingly p > p = Vtp gives 


h(x) = h(a) + DENG —&,)? + — h" (xi) (ž— %;)? +0 (=) (12.84) 


2t 


and for the Hamiltonian 
o EEn al 
Het (3P +E E—H)? + 5H (es) (WY vv) ) 


+00) +o(1) +0( 4) ees, (12.85) 
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Thus we can imagine computing perturbatively in 1/\/t where the leading term 


is given by 
Hiocal =f (5 + ; (h!"(xi))° (& = Bi)? + sh (xi) (pty = vv) 
= SP +t Me) +t h e) (Wy w) (12.86) 


where in the last equality we have put back #— x = Vt% after shifting so that 
the critical point occurs at x =0. Obviously this is the supersymmetric harmonic 
oscillator that we have just treated and completely understand. There will be, 
in this approximation, one bosonic supersymmetric ground state of zero energy, 
as in Equation (12.78), for each critical point with h”(x;) > 0 and one fermionic 
supersymmetric ground state of zero energy, as in Equation (12.79), for each 
critical point with h” (x;) <0. The eigenstate, say if bosonic, will be of the form 
(unnormalized) 


A 2 
|Eo) œ eT 3 CD @-2)* IQ) (12.87) 


which is the first approximation to the exact zero-energy state (unnormalized) 
which in this case is 
|Eo) =e?) |Q) (12.88) 


but with h(x) expanded about x; with the constant value of h(x;) absorbed into 
the normalization. Evidently, if we compute the perturbative corrections to the 
energy state in Equation (12.87), using the perturbatively (in 1/vt) expanded 
Hamiltonian (12.85), we will simply rebuild the exact zero-energy state given in 
Equation (12.88) from a Taylor expansion of h(x). However, at each stage of the 
perturbative calculation the wave function will be concentrated around z= 2;, a 
Gaussian multiplied by polynomial corrections corresponding to the higher levels 
of the harmonic oscillator. The energy admits an expansion in even powers of 
1/vt since the contribution from odd powers vanishes due to parity. However, 
the energy must actually remain zero at all stages of the perturbation, since we 
know that the energy of the exact wave function is exactly zero. Perturbative 
contributions at a higher order cannot correct a non-zero contribution to the 
energy at a lower order, hence the correction to the energy must be absent at 
each order. We can do this calculation around each critical point and, hence, 
perturbatively we will construct as many zero-energy modes as there are critical 
points. 

Since we know that in fact there is at most only one exact zero-energy mode, 
all but one combination of these perturbatively found zero modes must be non- 
perturbatively corrected to finite energy. The Witten index will be given as 


N 
Tr(-1)" = 5 sign(h” (x;)) (12.89) 


i=1 
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as each bosonic zero mode for h” (x;) > 0 contributes +1 and each fermionic zero 


mode for h” (x;) <0 contributes —1. Evidently, this sum must equal +1 or 0, as we 
have found, dependent on the asymptotic behaviour of h(a). This makes perfect 
sense as the number of concave and convex critical points can only change in equal 
numbers if we deform h(x) locally, as long as the asymptotic behaviour of h(x) 
is kept invariant. As we have said, these perturbatively found zero modes must 
not be exact zero modes, thus they must lift away from zero energy due to non- 
perturbative corrections. But then supersymmetry imposes that for each bosonic 
mode lifting away from zero energy there must be a corresponding fermionic 
one that is exactly degenerate in (non-zero) energy. Thus the non-perturbative 
corrections must simultaneously lift the bosonic and fermionic perturbatively 
found zero modes away from zero energy in pairs. 

The generalization of this theory to n dimensions and on a Riemannian 
manifold will bring us to Morse theory in the next section. 


12.4 Morse Theory 


There is a connection between the Betti numbers and critical points of real-valued 
functions defined on a manifold [94, 10, 89, 125, 62, 19, 64]. We do not consider 
arbitrary real-valued functions, but an essentially generic class of functions that 
are called Morse functions. Morse functions, for which we will use the notation 
h(a), are defined to be those real-valued functions that have a finite number 
of non-degenerate, isolated critical points. A critical point is where the first 
derivative of the function vanishes, which evidently is independent of the system 
of coordinates that are used. Thus the critical points of a Morse function occur 
at a finite number of discrete points, P,, and the condition that they be non- 
degenerate means that the determinant of the matrix of second derivatives in 
any system of local coordinates containing P,, the so-called Hessian matrix, has 
a non-zero determinant. This means the eigenvalues of the Hessian are non- 
zero. We can diagonalize the Hessian, a real symmetric matrix, by an orthogonal 
transformation of the coordinates, and shift the coordinates so that the critical 
point occurs at the origin of the coordinates. We can also rescale the resulting 
coordinates so that the positive eigenvalues are +1 and the negative eigenvalues 
are —1. Then around a critical point P, with p negative directions, there exists 
a coordinate system in which a Morse function appears as 


p n 
h(x) =ca -Y r? + `y r, (12.90) 
i=1 i=p+1 
where Ca is the value of the Morse function at the critical point. This rather 
reasonable fact corresponds to the Morse Lemma. It is clear that with an 
infinitesimal deformation of the Morse function, all values of the function ca 
at the critical points can be taken to be distinct. Furthermore, we can assume 
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that the values of the Morse function at the critical points are labelled in a 
monotone, ascending order, cy < ¢j41, /=0,--:-,N. p is called the Morse index of 
a critical point, and the number of critical points with Morse index p is called 
the Mp- 

A surprising fact corresponds to the understanding that the manifold can be 
reconstructed out of any Morse function that is defined on it. One considers the 
inverse map defining the submanifold (not including its boundary) 


M.={xE M5 h(x) <c}. (12.91) 


Clearly for c < co, where co is the global minimum of the Morse function, Me<co = 
@. The global minimum must exist as the manifold is assumed to be compact. As 
we increase c, when we pass co, but stay below the next critical point where the 
value of the Morse function is c1, the manifold Me; >c>cg is topologically always 
the same and what is called a 0-cell. The nomenclature, 0-cell, corresponds to the 
fact that the critical point which is the global minimum has 0 negative directions. 
A 0-cell is in fact topologically an n-dimensional ball, without its boundary. It 
is evident that the topology of M. -, does not change as we increase c, until 
we come to the value of c1, the next critical point of the Morse function. At the 
critical point c1, there are p negative directions and n—p positive directions. The 
manifold must attach a p-cell to the 0-cell that rises from the global minimum 
and the topology of the manifold must change as c passes from below cı to above 
cı by the attachment of a p-cell. A p-cell corresponds to a topological manifold 
that has p negative directions and n — p positive directions, such a manifold is 
sometimes called a p-handle. 

This construction will continue at each critical point of the Morse function. 
The topology of the set Me will be invariant for cı < c < ci41, the topology 
change occurring exactly and only at the critical points of the Morse function 
with values c;. At each critical point of the Morse index p we will have to attach a 
p-cell. Finally, for c > cy, where cy is the global maximum of the Morse function, 


Mesey =M (12.92) 


and at this point we will have reconstructed the entire manifold. As we approach 
the final critical point, we must attach an n-cell, as the global maximum has n 
negative directions. An n-cell is also, topologically, an n-dimensional ball, as was 
the 0-cell at the global minimum, except that it now has n negative directions. 
Nothing precludes the attachment of n-cells, 0-cells or in general any number 
of p-cells at intermediate critical points; if there are local critical points with p 
negative directions, that is what is required. Indeed, in principle, for a critical 
point of the Morse index p, we must add a p-cell. The detailed description of 
this attachment of p-cells, or p-handles as they are sometimes called, is rather 
straightforward and unremarkable. We will not describe it in any more detail. 
The reader can consult the literature cited above for the full details. 
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Obviously, the reconstruction of the manifold based on a given Morse function 
must obey some constraints imposed on it due to the actual global topology of the 
manifold. The actual global topology of the manifold cannot arbitrarily change 
by its reconstruction based on a given Morse function. The actual topology of the 
manifold specifically constrains how many p-cells exist in the manifold. Hence 
the reconstruction based on a Morse function must be in some sense redundant. 
This gives the first hint that the number of critical points with Morse index p 
must be restricted by the global topology of the manifold. 

The crudest example of such a restriction is, for example, the condition that 
there must exist only one global maximum and one global minimum for the Morse 
function. The topology of the manifold, that it is compact, imposes this condition. 
As any Morse function on the manifold can be interpreted as a height function, 
with a corresponding topology preserving deformation of the manifold, we can 
easily see that it is possible to eliminate pairwise, for example, a local maximum 
and a local minimum by simply deforming the Morse function or equivalently 
the manifold. Indeed, we will be able to show that the number of critical points, 
Mp, of the Morse index p is bounded below by exactly the topological properties 
of the manifold expressed in the Betti number B,, 


My > Bp. (12.93) 


These correspond to the weak Morse inequalities. There are also strong Morse 
inequalities, which we will introduce when appropriate in the sequel. 


12.4.1 Supersymmetry and the Exterior Algebra 


The realization of supersymmetry that we will use corresponds to the following 
identification in the exterior algebra of a Riemannian manifold, M, of dimension, 
n, where we will further assume that it is equipped with a smooth metric gij. 
Let Q =d, Qt = ô, and 


Qi=d+é Qz=i(d—-ô), H=dd+éd. (12.94) 


Then 

H=Qj=Q; and {Q:,Q2}=0, (12.95) 
i.e. the supersymmetry algebra is satisfied. p-forms are bosonic or fermionic 
depending on whether p is even (bosonic) or odd (fermionic). The Q; map bosonic 
states to fermionic states. 

What are the supersymmetric ground states for this quantum-mechanical 
theory? Evidently they are the zero modes of the Laplacian, those p-forms that 
are annihilated by the Laplacian, H = dô + dd = V?. But these are just the 
harmonic forms. The harmonic forms satisfy exactly 


Ve pp =0. (12.96) 
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The number of harmonic p-forms is exactly the dimension of the pth homology 
group, dimH,(M, R). Hence the number of supersymmetric ground states, 
dimH?(Q), of a supersymmetric quantum mechanics defined on a Riemannian 
manifold, M, is exactly equal to the Betti numbers, B,, of the manifold. 
Interestingly, supersymmetry has some relation to the global topology of the 
manifold as defined by the Betti numbers. 

The Witten index is obviously a topological invariant, the number of 
supersymmetric ground states can only change by pairs of bosonic—fermionic 
states lifting away from zero energy or coming down to zero energy. Therefore, 
we see that the Euler characteristic 


n n 


x(M) = S0(-1)?B, =X (-1)"dimH? (Q) = Tr (-1)" (12.97) 


p=0 p=0 


is in fact a topological invariant of the manifold. 


12.4.2 The Witten Deformation 


We deform the exterior algebra with an additional real parameter, t, and an 
arbitrary smooth real-valued function, h(x), defined on M, which will be the 
appropriate Morse function, and then we let 


di =e "tde 6, = etie", (12.98) 
These operators continue to satisfy d? = 0 = ô?, and so we define 
Qiu =dt+0t, Qu =i(de— 64), He = dtt + ôrdi (12.99) 
and the supersymmetry algebra is satisfied for each t 
Qi = Qi = Hen {Qu Q24} =0. (12.100) 
Then with Qi = (Qie — iQ2:)/2, deformed supercharges are given by 
Qi=d+tdh^ Qİ =8+t(dh^ V. (12.101) 


As before, the exact supersymmetric ground states are those that are exactly 
annihilated by Q, and by Q. These would be the analogue of the harmonic 
forms. In the local coordinate system these are easily determined; for example, 
for states annihilated by Q; we need to find p-forms that satisfy 


Qiwp = (d+tdh)w, =0. (12.102) 
Writing wy = wj,...;,d2"1 A+- Ada’? we get 
(OjWiy-ip + tOjhw,...ip dx Ada" A- Ada? =0. (12.103) 
This has an evident solution 


SMe es SEs Re (12.104) 
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where ¢j,...;, is constant, and similarly for Qi. However, this does not mean that 
we have actually found a harmonic form, the coordinate system is in principle 
only a patch on the manifold. To find the set of harmonic forms is, in general, a 
complicated exercise. The set of exact supersymmetric ground states does exist 
and their numbers are given by the corresponding Betti numbers. 

We define the Betti numbers, B,, analogous to the definition of the de Rham 
cohomology, as the number of linearly independent p-forms that satisfy dwp = 
0, i.e. closed with respect to d, but which cannot be written as the exterior 
derivative of a p—1-form, i.e. wp A diXp-1, ie. that are not exact with respect to 
dı. The point is that this definition of the Betti numbers is actually independent 
of the parameter t, the Betti numbers so defined must be equal to their usual 
values at t = 0. d, differs from d by conjugation with an invertible operator e”t, 


thus the mapping wp > e7” 


wp is an invertible mapping of closed but not exact 
p-forms in the sense of d, mapped to closed but not exact p-forms in the sense 
of d. The dimensions of these spaces are independent of t. 

At each point, P, of the manifold, M, choose a basis, {ax}, of the tangent 
space, Tp. We will also consider the dual basis {a**} of the cotangent space T%. 
The tangent space basis vectors and the dual space basis vectors can be thought 
of as operators on the exterior algebra, acting through what is called interior 
product for the {ap} and through the usual exterior product for the {a**}. Thus 


explicitly we have 


a*t = dz? A 


Qi = lð /ð; (12.105) 
where the interior product vy is defined as 
tv (wp) = Xp-1 2 Xp-1 (V1, wae. »Vp-1) = wp(V, Vi, e , Vp—1)- (12.106) 


This is just a fancy way of saying that we contract the vector index on the 
first index of the differential form. Thus for the present case V = 5 aj = ad, and 
Wp = Wiz ip@ A+ Acar? then ta, (Wp) = Wk,igipa? At A atp, Even more 
explicitly 


Pp 
aplat A---Aa™?) =N (Ta A. Aat Aa A---a™?, (12.107) 
l=1 


The operators {a**} are dual to the {ax}, and their action on the exterior algebra 
corresponds simply to exterior multiplication. Explicitly, the action on a given 
p-form is simply given by a**(w,) = a** A wy. These operators play the role of 
fermion creation and annihilation operators. The supercharges can be written in 
this notation as 


Q:=d+td;ha**A and Q!=6+tgd;ha;. (12.108) 
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The function h(x) can be differentiated in the coordinate system, then one can 
calculate in a straightforward, but somewhat tedious, manner, 


Hi = db + dd + tg O;hOjh + tg?" Di Djh |a", ax] , (12.109) 


where gf is the assumed Riemannian metric on the manifold M and 
D; is the covariant derivative with respect to the Levi-Civita connection 
associated to the metric, explicitly, D;D;h = D;Ojh = 0;0;h — T} ðh with 
Ti = 4g" (O:91; +3jgu — ðigij). For large t, the potential t?g‘70;h0;h dominates, 
and the wave function concentrates about the minima (critical points) of this 
potential. Corrections can be computed as an expansion in powers of 1/vt, 


exactly as in the one-dimensional case. 


12.4.3 The Weak Morse Inequalities 


h(x) will be called the Morse function, and we will assume it is non-degenerate, 
meaning that it only has isolated critical points at coordinates x°%, at which 
0;h(«*) = 0. Therefore, at each critical point the matrix of second derivatives, 
D;D;h, must be non-singular, t.e. it does not have any vanishing eigenvalues. 
We define Mp to be the number of critical points with p negative eigenvalues. 
The first Morse inequality states that Mp > Bp, which we will be able to prove 
with our supersymmetric quantum mechanical model. 

Let APA be the nth smallest eigenvalue of H; acting on p-forms. We will see 
that 


M(t) = tAM™ + o(1) + 0(1/t), (12.110) 


which admits an expansion in powers of 1/t due to parity. The Betti number, 
Bp, is equal to the number of exactly zero eigenvalues. For large t, the number 
of the eigenvalues that vanish can be no larger than the number of A” that 
vanish, simply because a vanishing eigenvalue requires A”) = 0. We will show 
that the number of A”) that vanish is equal to the number of critical points of 
the Morse function with p negative eigenvalues, which means that Mp > Bp. 

At each critical point we can use Gaussian normal coordinates x’, coordinates 
in which the metric is simply 4;; and shift the origin so that they are chosen to 
vanish at the position of the critical point. The Morse function can be expanded 
in a Taylor series; in general, this gives 


h(x) = h(0) +5 5 (5o) agi po, (12.111) 


i,j=0 

A further orthogonal rotation of the coordinates keeps the metric 6;;; however, 
the real symmetric matrix of second partial derivatives can be diagonalized, with 
eigenvalues A; = 0;0;h(0) in the new coordinates. Then we get 


h(a) =h(0) +5 Ma + (12.112) 
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and 
ihla) = gate. (12.113) 


The Hamiltonian then also admits a local expansion about each critical point, 
using the general expression Equation (12.109) and noting that the metric is 6;,, 
the Levi-Civita connection vanishes so that covariant derivatives are ordinary 


derivatives and 0;h(x') = Azt +--+ using Equation (12.112) 
= o? 2320 i2 i 
=> (L PATH + tAlt as) t. 12.114 
Hi > Irar a(x) + tàila aJ) + ( ) 


The explicitly written term, although an approximation to the full Hamiltonian, 
is sufficient to compute the AM, To compute the expansion of the eigenvalues 
in powers of 1/t requires calculating the higher-order terms in the Hamiltonian 
and continuing the perturbative expansion. 

As the operators a; and a* are also simply linear operators on the exterior 
algebra by exterior or interior multiplication, they commute with the simple 
harmonic oscillator part and hence the local Hamiltonian in lowest approximation 
can be written as two commuting terms 


Hiocal = X Hi +tA:K; (12.115) 
i=l 
with 
a 2AP (xt)? 12.116 
H; =-——— +? (ri : 
? Oxidant | i (2") ( ) 
while 
K; = [a™, aj]. (12.117) 


H; is the Hamiltonian of the simple harmonic oscillator, with the well-known 
spectrum E;(N;) = ¢t|A;|(1+2N;), where N; =0,1,2,---, taking into account that 
the A; are not necessarily positive. The corresponding eigenfunctions are Hermite 
polynomials multiplied by Gaussians centred at the origin, and hence rapidly fall 
off for |A;x*| > 1/vt. 

The eigenvalues of K; are simply +1. The action of K; on a p-form w is 


Kyw = [a*t aiw = a ajw — aja**w = 2a*' aww — {a;,a**}w = (2a*ta; — 1)w. The 
first operator is simply the fermionic Hamiltonian for one degree of freedom for 
each i, which has eigenspectrum 0 or 2, acting on the fermionic vacuum state or 


the one fermion state, which yields the eigenspectrum +1 for K;. Another way to 
see this is to realize that the action of K; on a p-form w = Way ga A+ Natr 
obviously gives back w if i € (i1 +++ ip) but gives back —w if i ¢ (i1---ip). 

As K; and H; commute, the eigenvalues simply add; thus, the spectrum of 
Hiocal is 


E(Ni,ni) =t > (\Ail(L+2Ni)+nidi), Nj =0,1,2,-++ ni = 


i=l 


1. (12.118) 
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If we restrict the action of Hjoca) to p-forms, then the sum over K; in the 
Hamiltonian contains p terms for which the eigenvalue of K; is +1; thus, the 
number of n; that equal +1 must be equal to p. The remaining K; will have 
eigenvalue —1, thus the number of these will be n — p, where n is the dimension 
of M. 

The only way it is possible for the energy F,(.N;,n;) to vanish is if all N; =0, 
ni = 1 for each negative A; and n; = —1 for each positive \;. We can solve this 
constraint if we choose the p-form to consist of the p-fold exterior product of 
coordinate differentials of exactly those coordinate directions which correspond 
to the negative eigenvalues. Thus the energy eigenvalue is (allowing for a minor 
relabelling of the independent directions in the manifold) 


Pp n 
Ei(Nan =A ADAH Y (AOA) 0 (12.119) 
i=1 i=p+1 
as n; = +1 for the first p directions with negative eigenvalues and n; = —1 for 


the n — p remaining directions for which the eigenvalues are positive. 

Thus for a critical point with Morse index equal to p, i.e. with p negative 
directions at the critical point of the Morse function h, it is possible to satisfy 
these conditions. We choose a p-form with a coefficient function given by the 
ground state of the harmonic oscillator (which puts all the N; = 0), and which 
consists of exactly those coordinate differentials which correspond to the p 
negative directions, \;, which gives the desired n; = +1. Thus at a critical 
point of Morse index p, we can construct exactly one eigenfunction which could 
have a zero eigenvalue. These are zero-energy eigenfunctions of the approximate 
Hamiltonian given in Equation (12.115). We could, in principle, compute the 
corrections that are brought to these approximate zero-energy levels, but we can 
be assured that they will remain low-lying levels even as t > œo, the key point 
being that AM) vanishes for all of these levels. The dimension of the subspace 
spanned by these levels is Mp, the number of critical points with Morse index p. 

For an actual vanishing eigenvalue of the full Hamiltonian (12.109), all 
higher perturbative and non-perturbative corrections must also vanish. This 
will happen for the exact supersymmetric ground states. The number of exact 
supersymmetric ground states is given by the Betti number, Bp, which is equal 
to the number of p-forms with zero eigenvalues of the Laplacian, dé + ôd, or 
the deformed Laplacian, d,6; + 6d. For each actual zero eigenvalue, we know 
that the A”) must also vanish, as the computation of AS is the first step of 
computing the exact eigenvalue in perturbation. We have determined that the 


number of approximate states corresponding to Aw =0 is Mp, the number of 
critical points of Morse index p. Hence the number of actual zero eigenvalue 
states must be less than or equal to M,. Thus we obtain the result 


My > Bp- (12.120) 
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These are called the weak Morse inequalities. 


12.4.4 Polynomial Morse Inequalities 


We actually wish to prove something stronger, that the Morse numbers always 
dominate the Betti numbers as encapsulated in the polynomial Morse inequality 
which states that there exists a set of non-negative integers Qp such that 


n n n-1 
XOM — 5" Bot? = (1+t) Y Qpt?. (12.121) 
p=0 p=0 p=0 


This is an inequality in the sense that Qp > 0. As the weak Morse inequalities 
give us that M, > Bp, it is clear that the coefficient of t? on the left-hand side 
is necessarily positive semi-definite. The right-hand side has the coefficient Qp + 
Qp-1 (with Qn = Q-1 =0) for t?, which then must be positive semi-definite. 

The polynomial Morse inequality is equivalent to the following two assertions, 
called the strong Morse inequalities (as originally proven by Morse): 


So(-1P M, > S0(-1)?*"B, for m=0,1,---,n (12.122) 
p=0 p=0 
S 7 (-1)? Mp = 5 (1 Bp. (12.123) 
p=0 p=0 


We can prove the equivalence as follows. If we take the second equality, 
Equation (12.123), we have 


(Seay -So0,) 
p=0 p=0 


i.e. t=—1 is a root of the polynomial }7)_, Mpt?” — X p-o Bpt? and hence it is 
divisible by 1 +t. Thus we have immediately and trivially 


=0, (12.124) 


t=-1 


n n n-1 
S > M-Y Bot? =(1+t) >_ Q. (12.125) 
p=0 p=0 p=0 


Since the coefficients are integers on the left-hand side, the Q, must also be 
integers. It remains to show that Qn > 0. To see this we analyse the identity 
power by power in t. We start with the t? term. This term gives 


Mo — Bo = Qo, (12.126) 


which the first inequality, Equation (12.122), for m = 0 requires Qo > 0. Next, 
for the t term we have 
My — By =Q1+Qo (12.127) 


or replacing for Qo from above 


Mı — Mo — (Bı — Bo) = Qı, (12.128) 
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which the first inequality, Equation (12.122), for m = 1 then requires Q > 0. 
Doing one more step, before concluding the general relation, we have for the 
coefficient of t? 

Mə- By =Q2+Q1 (12.129) 


replacing for Qı from above 
Mə — Mı + Mo — (B2 — Bi + Bo) = Q2. (12.130) 


Again from the first inequality, Equation (12.122), for m = 2 then requires Q2 > 0. 
We see then that in general 
m m 
XOP My — X_(-1)P*™B,=Qm for m=0,1,-,n—1 (12.131) 
p=0 p=0 
and hence we can conclude that Qm > 0 for all m = 0,1,2,- -n — 1. 
To prove the converse, the polynomial Morse inequality, Equation (12.121), by 
comparing powers of t, as we have just seen, implies 
n n 
So(-1)? t Mp- X_(-1) "Bp =Qm for m=0,1,---,n-1 (12.132) 
p=0 p=0 
but now we assume that the Qm > 0. Hence we recover the first inequalities in 
Equation (12.122) trivially. To recover the second equality, Equation (12.123), 
we simply put t = —1 in Equation (12.121). 

The second equality, Equation (12.123), is related to the Euler characteristic 

of the manifold. This is defined as the alternating sum of the Betti numbers 
n n 
x(M) = 5° (-1)? B, = X_(-1) Mp. (12.133) 
p=0 p=0 

From the weak Morse inequalities, we know that M, > B,. Thus the number of 
critical points of Morse index, p, could be greater than the Betti number, B,, 
but then there must be exactly the same surplus of critical points with opposite 
value of (—1)?, i.e. each additional critical point of Morse index, p, must pair 
with another critical point of Morse index of opposite parity. As p determines if 
the state is fermionic or bosonic, we identify these pairs of critical points with the 
approximate, supersymmetric, bosonic and fermionic zero energy pairs of states 
associated with each critical point, but those which must actually lift away from 
exact zero energy when non-perturbative corrections are taken into account, as 
the actual number of supersymmetric zero energy states is strictly given by the 
Betti numbers. 

It is straightforward [64] to prove the strong Morse inequalities using the simple 
ideas of supersymmetry and what we have understood about the spectrum. We 
know that the eigenvalues and corresponding eigenstates of H+, separate into 
two subsets as t —> co; those whose energies diverge linearly as t gets large and 
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a finite number whose energies do not. There are Mp states for each p whose 
energies do not diverge with t. These further split into two subsets, the first 
B, states whose energies are exactly zero and the remaining M, — B, states 
whose energies are of o(1). We will call these latter M, — Bp states the low- 
lying states. But now we recall that, since the low-lying states have non-zero 
energy, supersymmetry requires that they come in bosonic—fermionic pairs. The 
fermionic states correspond to odd p and the bosonic states correspond to even 


p, hence we must have 


X (Mp — By)= X. (Mp By). (12.134) 
p odd p even 
This immediately implies the second of the strong Morse inequalities, 
Equation (12.123) 


n n 
XA C1PM, =X (1 Bp. (12.135) 
p=0 p=0 

To obtain the first strong Morse inequality we consider the mapping that Qi: 

induces on the fermionic and bosonic subspaces of low-lying levels. As Q?, = 

H, and evidently Qi; commutes with the Hamiltonian, it must preserve the 

eigensubspaces of H;. Qır, being a fermionic operator, maps the eigensubspace 

of p-forms to the eigensubspace of p+ 1-forms and p—1 forms. Let A? denote the 
subspace of low-lying eigenstates of p-forms, clearly of dimension Mp — Bp. For 
any state |) in this subspace Q);|W) £0 and H:Quy:|v) = Q3, |v) = Qu Hilh) = 

EQiu\w), where Hily) = E|w). If Qi, maps two distinct states to the same state, 

then it must annihilate their difference, which is not possible as this does not 

preserve the eigenspace. Thus the mapping Qiu : A? + APT! @ APH! must be 
one-to-one, into (injective). Hence we can conclude 


2j-1 2j 
Q: OQ ro A 
p odd p=1 p even p=0 
2j 2j+1 
Q: A va QW (12.136) 
p even p=0 p odd p=1 


for each 7 30< 27 <n and 0<2j+1 < n. But since the mappings are injective, 
the dimension of the domain must be less than or equal to the dimension of the 
image. This yields: 


(Mı — By) +--- + (Mzj-1 — B2j—1) < (Mo — Bo) +--+ (Ma; — B25) 
(Mo — Bo) ++: + (Mo; — B25) < (Mi — Bi) +++ + (Moj41 — B2j+1). 
(12.137) 


These inequalities are identical to the first strong Morse inequality, 
Equation (12.122), if we bring the Ms and Bs to opposite sides. 
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12.4.5 Witten’s Coboundary Operator 


The polynomial Morse inequality is equivalent to the understanding that the 
critical points of a Morse function form a model for the cohomology of the 
manifold M. We define X, to be a vector space of dimension M, for each p € 
0,1,2,---,n. Xp can be thought of as a vector space spanned by the critical points 
of Morse index p. The polynomial Morse inequality, Equation (12.121), means 
that there exists a coboundary operator ôw : Xp —> Xp+ı (we add a subscript 
W to honour Witten), where 6%, = 0 and the corresponding Betti numbers, 
the dimension of the cohomology groups associated to dy, are identical to the 
Betti numbers of the manifold M. The homotopy classes in this cohomology 
are elements of Xp, which are closed under the action of dw, but differ only by 
elements which are obtained by the action of dw on some element of Xp—1, the 
analogue of the standard notion of closed modulo exact forms, or cycles, etc. The 
explicit expression for dw is not given in the original work of Morse or others; 
however, Witten found an appropriate expression for it. 

Witten proposed the following construction. First, consider possible zero 
modes of the Laplacian. The number of independent such p-forms gives the 
Betti numbers, Bp. We have an upper bound on the Betti numbers, Mp > Bp 
in the Morse inequalities. However, although perturbation theory might suggest 
a given mode is a zero mode, tunnelling effects can lift the degeneracy. Exact 


—tS where S is the action of 


instanton effects can give energies of the order of ~ e 
the instanton, which for large t is smaller than any perturbative correction. Thus 
Witten was led to consider instanton configurations that tunnel from one zero 
mode to another. In fact, tunnelling from putative zero modes which are p-forms 
to putative zero modes which are p+ 1-forms are exactly the instanton modes 
that are required. However, as we have seen, the p+ 1-form chosen at a given 
critical point of Morse index, p+ 1, requires a choice of the exterior product of 
all the coordinate differentials that correspond to the p+1 negative directions. 
The orientation or order of the differentials remains arbitrary. Thus a tunnelling 
transition from a state at a critical point of Morse index, p, to a state at a critical 
point of Morse index, p+1, must also fix a sign. We determine the sign with the 
following construction. 

Consider instanton paths, I, that pass from a critical point, B, of Morse 
index, p+ 1, to a critical point, A, of Morse index, p. The instanton path has 
initial tangent vector v within Vg, the p+ 1-dimensional vector space of negative 
directions at B. Let |b), a p+1-form, be the state of zero energy at the critical 
point B. Then |b) chooses an orientation of Vg, and we can choose an orientation 
of the p-dimensional subspace, Vp, corresponding to the orthogonal complement 
of v within |b), as we can generate a p-form from |b) by contracting it (by interior 
multiplication) with v. Then the instanton path from B to A gives a mapping 
of Vg to V4, the p-dimensional vector space of negative directions at A. This 
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mapping induces an orientation of V4. However, the state |a) corresponding to 
the perturbative zero mode at A already gave an orientation of V4. We define 


H if the induced orientation agrees with that fixed by |a) 
nr = 


—1 ifthe induced orientation disagrees with that fixed by |a) 
(12.138) 
and 


n(a,b) =X nr, (12.139) 
r 


where the sum runs over all instantons paths (paths of steepest descent) from B 
to A. Then we can define the coboundary operator, for any basis element |a) of 
Xp at A 

dwla) =X n(a,b)|b), (12.140) 

b 

where the sum runs over all basis elements of Xp+ı (in other words, this is a 
set of perturbative zero modes that are p + l-forms that are concentrated at 
the critical points of Morse index p+ 1 of the Morse function). The effect of the 
instantons is to non-perturbatively correct the energy of some of the perturbative 
zero modes, their energy behaves as ~ e™t5, for large t. Thus all states in Xp are 
not annihilated by the Laplacian dy dy, + ôiy ôw. 

Denoting Y, as the number of actual zero eigenvalues of dw df) + dj Ow acting 
on Xp, then Y, also give upper bounds on the Betti numbers, and the strong 
Morse inequality, Equation (12.121), remains valid with Mp replaced with Y,. 
Witten conjectures that, in fact, Yp = Bp. 


12.4.6 Supersymmetric Sigma Model 


To demonstrate that dy as defined in Equation (12.140) provides the 
appropriate coboundary operator, Witten considered the Lagrangian version of 
the supersymmetric quantum-mechanical model that we have been considering, 
that for which the supercharge is given explicitly by d;. Canonical quantization 
of the model defined by the action, in Minkowski time 


3 ul A dx? dzi i -; Dyt i 1 o Tink Tj nl 
fare=3 foo (F dr pT) T gPa Y wy 


. dh dh Dh -_... 
2 ij inj 
WD Adai DD Y Y ) (12.141) 


where a sum over all repeated indices is understood, gives the required algebraic 
symmetries and explicitly the supercharge. This is the Lagrangian of the 1+1- 
dimensional supersymmetric sigma model restricted to 0+1 dimensions. Here the 
w and w (the complex conjugate field to 7) are anti-commuting fermionic fields, 
xt are local coordinates, gij is the metric tensor and R,j;x is the corresponding 
Riemann curvature tensor on M, and D/Dz* is the covariant derivative with 
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the Levi-Civita connection of the metric while D/Dr is the covariant derivative 
along the direction tangent to the time trajectory. Specifically, acting on the 
fermions we have 


2y = 0 yt HI pI y. (12.142) 


Under the supersymmetry transformations 


ba’ = ey)’ — Ey’ 
Sy = e (izt — Tipp y" + tg jh) 
õp =E (—iit -Ti p Y" + tg Ojh) (12.143) 


for infinitesimal anti-commuting parameters € and €, the action is invariant, 
ô f dr£=0. The corresponding supercharges are as required 


Qt = yp (igit! +tô;h) 
Qi = Y (—igigt? +tôih). (12.144) 


There is also a symmetry-conserving fermion number, yt + eyi, ypt > ey", 
which gives the conserved charge, the fermion number 


F = gijp y. (12.145) 


In quantizing the system we will first consider the system at t = 0 (all the 
supersymmetry and other symmetries are equally valid at t = 0). We impose the 
canonical commutation and anti-commutation relations 


|x", pj] =ið; 
{y p} =g”, (12.146) 


then the conserved supercharges are simply Q = iw'p; and Q = —iy'p;. The 
supercharges have the opposite fermion number 


[F,Q] =Q, [FQ] =-9 (12.147) 


We impose that the Hamiltonian is given by the supersymmetry algebra 


{Q,Q} =2Ho (12.148) 


and consequently the fermion number is conserved, [F,Ho] = 0. The natural 
realization of this algebra is, as we have been using, provided by the exterior 
algebra of differential forms, A*(M) &® C equipped with its hermitian inner 
product from Equation (12.17) 


(w, x) = f on (12.149) 
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for two p-forms, w and x. Then the observables in this realization of the algebra 
on this Hilbert space, when acting explicitly on a p-form w, are: 


x :atw 

pi: —1Ojw 

w:dz' Aw 

Wg teas (12.150) 


(ty is the interior multiplication defined in Equation (12.106)). Then with the 
state |0) denoting the form annihilated by all of the f we have the schema: 


0)=1 
Y*|0) = da’ 
ww |0) = dx’ A dx? 
p++" |0) = dat A- Ada”. (12.151) 


The fermion number of a state that is a p-form is simply equal to p, thus 
the Hilbert space separates into bosonic and fermionic subspaces depending on 
whether p is even or odd, respectively. Thus the canonically quantized system 
reproduced with complete fidelity the supersymmetric system of the exterior 
algebra that we studied in subsection (12.4.1). 

Recall then that the supersymmetric states are just the zero-energy states, 
those annihilated by the Laplacian, the so-called harmonic forms. We underline 
that the set of harmonic forms of the manifold characterize the de Rham 
cohomology of the manifold. Equally well, the space of supersymmetric ground 
states characterize the cohomology of the Q-operator. As there is the conserved 
fermion number which satisfies |F, Q] = Q, the Q-cohomology is graded by the 
fermion number and equal to the degree p of the form. As Q is identified with the 
exterior derivative d, the graded Q-cohomology and the de Rham cohomology 
must be equal 

H” (Q) = H}gR(M). (12.152) 
The Witten index, (—1)", can be evaluated and we find 


n n 


Tr ((-1)") =< (—1)?dim(H(Q) = Y (-1)Pdim(H?gR(M) = x(M), (12.153) 
p=0 p=0 

where y(M) is the Euler characteristic of the manifold. The Witten index 

only receives contributions from the supersymmetric ground states; as we have 

seen, the non-zero energy modes are all paired in fermionic—bosonic pairs and 

their contributions cancel. Thus the calculation of the topological invariant, the 

Euler characteristic, can be done by studying the zero-energy modes of this 
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supersymmetric quantum mechanical system. Witten’s magical trick was to add 
an external field to this system, which causes a separation of the zero- and low- 
energy modes from the finite-energy modes, and in the limiting case makes the 
calculation of the zero mode sector very simple. 
Now adding in the deformation by th, the supercharges are then given by 
_ -8 
Qi =Y (igiit +tôih) = da! A (gza ttih) = d+tdh^ = e det = d; (12.154) 
g 
and 
Qi =54+t(dhr)* = 8e" = ô, (12.155) 


where x denotes the adjoint. The Hamiltonian then is as before 
1 = 1 
H = 5 {Qe Qe} = 5 (dedi + didi), (12.156) 


chosen to satisfy the supersymmetry algebra. The supersymmetric ground states 
again define the Q:-cohomology. However, since the th deformation is obtained 
by a similarity transformation 


Qi = Qet (12.157) 


the cohomology is isomorphic to the undeformed case. As the cohomology of the 
undeformed Q is isomorphic to the de Rham cohomology, we can compute the 
de Rham cohomology with the deformed operator Qr. 

The perturbative approximation to the Hamiltonian around a critical point is 
given by 


n 2 
Ht = >, (- —_ +0732 (at)? + tr a"“a) (12.158) 
with exact, zero-energy ground-state wave functions, which we will label |éw), 
corresponding to the harmonic oscillator ground state, ¢, multiplied by an 
appropriate p-form, w, where p is the Morse index of the critical point, as 
discussed previously. Indeed, perturbative corrections to the energy of these wave 
functions must vanish to all orders: the energy remains exactly zero to all orders 
in perturbation theory. One can find the modification of the wave function, order 
by order, so that its energy remains zero in each order in perturbation theory. 
This is because the corrections are calculated in terms of local data at the critical 
point. From local data it is not possible to know which critical points are actually 
necessary because of the global topology of the manifold and which critical points 
are removable by deformations. States that have zero energy to lowest order 
have zero energy to all orders. The same reasoning applies to the calculation of 
tunnelling in the double-well potential. In perturbation theory we can never get 
a non-zero tunnelling amplitude, these amplitudes are non-perturbative in the 
coupling and are not seen at any order in perturbation theory. 

However, the wave functions, |¢;w;), are not necessarily exact ground states, 
where we have added a label ¿ to denote different critical points. The perturbative 
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zero-energy states are not necessarily exact ground states. Hence the number of 
exact, supersymmetric actual ground states are clearly less than or equal to the 
number of critical points. An exact supersymmetric ground state is annihilated 
by the supercharge. For the case of the perturbative ground states, although we 
will find, if calculated perturbatively, 


Hildiwi) = (QÍ + Qi Qz)|dieri) = 0, (12.159) 
which also requires 
Qt|oiwi) = 0 (12.160) 
to all orders in perturbation theory, we can in fact have non-perturbative 
corrections 
N 
Q:|¢iwi) = yo [pjwi) (Pj j|QelPiwi) +=, (12.161) 
j=1 


where the +--- corresponds to amplitudes to non-zero-energy states (which are 
suppressed by large energy denominators as t + oo). The explicit mixing that 
can be important is between the perturbative zero-energy states. Thus we want 
to compute 


(dyusIQr lien) =f bywy Ax(d-+ tahr)o je (12.162) 


But such an amplitude is exactly what we are looking for with the coboundary 
operator dy between zero modes localized at different critical points. If w; is a 
g-form and w; is a p-form, this matrix element can only be non-zero if q=p+ 1, 
i.e. transitions between perturbative zero-energy modes correspond to critical 
points of Morse indices that differ by one negative direction. This can also be 
seen from fermion number conservation, the action of Q; on the state |d;w;) 
changes its fermion number by one unit. It also should not be surprising that the 
eventual dw that we will be able to define will satisfy 67, = 0, since it is obtained 
from the action of Qi = di. Clearly Q? = 0, hence we can expect 57, =0. 

We will return below to the notation of subsection 12.4.5 with |a) for |¢;w;) 
and (b| for (¢;w,;| and the understanding that if |a) corresponds to a p-form then 
(b| corresponds to a p+1-form. It is also clear that the action of Q; on the low- 
lying states annihilates any exact, supersymmetric ground state that is a p-form 
as these are harmonic with respect to Q+. Thus only the Mp — Bp low-lying but 
not exact supersymmetric ground states will be mixed with low-lying p+1-forms. 
But additionally, none of these states can be the exact supersymmetric ground 
states that are p+ 1-forms, since the inner product 


(b|Q:|a) = (Q:|b))" Ja) =0 (12.163) 


if |b) corresponds to an exact supersymmetric ground state, as these are also 
harmonic with respect to Q+. Thus the action of Q+ on the set of states |a) only 
mixes the Mp — Bp not exact ground states but low-lying states with the Mp1 — 
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By+1 corresponding low-lying states |b). This is as it should be; since mixing 
causes the energies to go up, this cannot happen to any exact supersymmetric 
ground state. 


12.4.7 The Instanton Calculation 


We will use the path integral to compute this amplitude, since we know that it 
is exactly through the path integral that we can uncover tunnelling amplitudes 
through the path integral, and from the amplitude we will extract the coboundary 
operator, dy. The bosonic sector of the model is governed by the Lagrangian, 
in Euclidean time 


1 dx’ dxi . dh dh 
drLy == | dr | gij Pg ele 12.164 
J Té J ata dr dr y dx? =) ( oF 


We can show that the stationary points of the corresponding action are the paths 


of steepest descent using a Bogomolny-type identity [17]. Indeed, 


1 dx’ » dh dxi „dh dh 
d = — | drq;; + tg! + tgl! tf a (12.1 
J Th al TIij ( dr I T) ( dr” =) dr dt pete) 


The first integral is positive semi-definite, while the second integral is equal to 
tAh. Therefore, if tAh > 0, we choose the plus (lower) sign, while if tAh < 0, 
we choose the minus (upper) sign. Then the second term is always positive, and 
thus 


perce > t|Ah| (12.166) 
with equality for (assuming Ah is positive) 
dz’ .. dh 

— tg” — =0. 12.167 

dt £ dxi ( ) 


This is exactly the equation of steepest descent, physically stating that the 
tangent vector to the curve is parallel to the gradient, up to reparametrization. 
Also it should be noted that this equation is not the same as the usual instanton 
equation which we have seen can be interpreted as ordinary, conservative, 
Newtonian cinematic motion of a particle in the reversed potential. Such a 
motion would never follow a path of steepest descent and stop at a lower value 
of the potential. Here the equation of steepest descent is first order in the “time” 
coordinate, and thus allows such motion. The solution to Equation (12.167) 
obviously exists, which then implies Sp = f dr£Ly =t|Ah|. Then for the operator 
d; whose matrix elements we want to compute, they will then be proportional 
to e IAI, If we want to compute matrix elements of the Hamiltonian dd* + d*d 
then we get two factors of Sz and hence the amplitude is proportional to e~74l4"!, 

The next step in the calculation is to compute the determinant of the 
fluctuations in Gaussian approximation about the instanton configuration. It is 
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in this stage that the calculation dramatically simplifies due to supersymmetry. 
The non-zero eigenvalues are all paired in bosonic and fermionic multiplets. The 
fermionic determinant is exactly cancelled by the bosonic square root of the 
determinant. The bosonic zero mode corresponding to Euclidean time translation 
invariance, which would normally give rise to a diverging factor of {, is also 
exactly cancelled by a corresponding fermionic zero mode which would normally 
give rise to a vanishing determinant. These zero modes can be explicitly obtained 
first for the bosonic case in the usual way, the bosonic zero mode corresponds 
to the Euclidean time derivative of the instanton. Then the fermionic zero mode 
is obtained by a supersymmetry transformation of the bosonic zero mode. To 
show the cancellation of the contribution of the zero modes requires some care, 
we refer the reader to the detailed calculation in [62]. Finally the amplitude is 
given by the factor 


(b|d,|a) = eA", (12.168) 


However, we still have not determined the sign of the amplitude; the functional 
integral always gives rise to an ambiguous sign due to the fermions. To determine 
the sign, we go back to the calculation of the amplitude in the usual WKB 
method of Schrödinger quantum mechanics. Here we know that the states |a) at 
the critical point A and |b) at the critical point B rapidly die off, away from their 
respective critical points. Any overlap is greatest along the paths that connect 
the two critical points that are the semi-classical solutions to the equations of 
motion, the paths that keep the Euclidean action stationary. These paths are the 
instantons, the paths of steepest descent or ascent between the critical points. 
Thus the behaviour of the states along the paths of steepest descent are enough 
to determine the sign of the matrix element (b|d;|a). The quantum mechanical 
problem becomes effectively one-dimensional along the path of steepest descent, 


and we find that the state |a} drops off as e~!" 


along the instanton that ascends 
from A to B. It must ascend, as |a) was a p-form, hence A was a critical point of 
p negative directions while |b) was a p+1-form, hence B was a critical point of 
p+1 negative directions. If we descend from A we can only reach other critical 
points with fewer negative directions, we can never reach B. 

To determine the sign, we start at |b) at B and the orientation of the space 
of negative directions at |b), which we called Vg. Calculating |b) along the path 
of steepest descent in the WKB approximation we find the wave function of the 
state |b) at A, but it is still a p+ 1-form. However, with the limiting direction 
of the tangent vector as we arrive at A, we can induce an orientation of V4, the 
space of negative directions at A. Then the sign of the matrix element. (b|d;|a) 
is +1 if this induced orientation of V4 matches that furnished by the state |a), 
otherwise it is —1. This is exactly the construction of the sign n(a,b) that was 
described in section 12.4.5, but appended with the explicit transport afforded by 
the WKB calculation of the wave function along the path of steepest descent. 
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Hence the Witten coboundary operator is given by 


Sw la) = Set) (a,b) |b). (12.169) 
b 


Since the path descends from B to A, the exponent has the right sign. This factor 
can be removed by rescaling the wave functions by 


la) > eA) Ja), (12.170) 


which corresponds to undoing the conjugation by et” which transformed d to dz. 
Hence the Witten coboundary operator is given by 


dwla) = X n(a,b)|b) (12.171) 
b 


and the notion that the set of critical points of a Morse function form a model 
of the cohomology of the manifold M is verified. 


Appendix A 
An Aside on O(4) 


O(A4) is the group defined by the multiplication properties of the set of orthogonal 
matrices which keep the quadratic form 


4 
Son (A.1) 
i=l 
invariant. If 
7 =OF# (A.2) 
then 
gf. =2-0°O.¢=2-2 
SOV = (A.3) 


where 7 is a four-dimensional vector. Looking in the neighbourhood of the 
identity, we find, with O = 1 +ô, then 
OTO = (1467) (1+8) =1+6+67 +0(6?) =1 
>6+67=0. (A.4) 


This means that ô must be an anti-symmetric, 4 x 4 matrix. This defines the Lie 
algebra of O(4). The complete set of anti-symmetric 4 x 4 matrices is given by 


(Mav)or T aobu T bur Ove 
E g SHV APE Apo: (A.5) 


It is easy to calculate 
1 
[Muv, Mor] = 4 (EuvapEApyðEorsbEorab rs Ea Bor Ear ys Ep rApEApsw ) : (A.6) 
We can expand this further; it is easy to do some of the sums over dummy 
indices, but it is more illuminating to define 


1 1 
Ji = 9 Sisk (Mj) mn = 5 Stak (d;15km Ati Ôjmôki) = Cilm (A.7) 
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and 
Ki = (Moi) im: (A.8) 


Then the commutators 


Ji, Jj] = tijk Ik 
[Ji, Kj] = eij: Kn (A.9) 


follow directly, with Jı = M23, J2 = Msı and J3 = Mj. To calculate [K;, K;] we 
consider the generators 


J, =Mi, Ja=Mæ, J3= Mai, (A.10) 


which generate the subgroup that leaves the form x2 +x? + x73 invariant. Then 
because of rotational symmetry we must have 


[33] = éijk Jp. (A.11) 

(We can check this, for example, with ESA = |Mi2, Moo] = [Mi2, M20] = 
[J3, —Ko] = —6321 Kı = Kı = Moi = J3.) Thus 

ZA = [Mæ, Mo1] = [-K2, Kı] = Jı = Min = Jz (A.12) 


thus 
[K1, K2] = J3 (A.13) 


hence rotational covariance dictates the general relation 


[K;, Kj] = éijk Jk- (A.14) 
The combinations 1 
Mi = 5 (J; + Ki) (A.15) 
satisfy the commutators 
[Mj M) ] = eij Mg (A.16) 
while 
[M}, M7] =0. (A.17) 


Appendix B 
Asymptotic Analysis 


Asymptotic analysis concerns the notion of the behaviour of functions, f(z), 
as certain parameters go to their limiting values, usually zero or infinity. For 
convenience and without loss of generality, we will consider functions as their 
arguments go to infinity. Obviously the limit to any finite value xg can be 
obtained by taking y = Goa) to infinity. 


We define 
f(z) ~ g(x) (B.1) 
if and only if 
Jim, Ln) >1. (B.2) 


The binary relation of equivalence satisfies many obvious properties: for any 
smooth function F (y), if f(x) ~ g(x) then 


F(f(x)) ~ F(g(2)). (B.3) 
This specifically is useful when applied to powers, f ~ g implies 
F ~g (B.4) 
for any real number r. If f(x)~ g(x) and a(x) ~ b(x) then 
a(x) f ~ b(x)g(2). (B.5) 


Asymptotic analysis is most useful in the application of asymptotic expansions 
of functions. An asymptotic expansion of a function f(x) is a series representation 
of a function that does not necessarily converge, and hence must be truncated 
at the expense of adding a remainder term. A very famous example of an 
asymptotic expansion is the Stirling approximation for the factorial, N!. The 
Stirling approximation is given by 

N-1 


1 B 
SEER e E De = 


Inn — lem + Rw), (B6) 
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where Bn are the Bernoulli numbers with 
| Bon 


< B. 
Ry < 2N (2N — 1) |z|] (Ps 
for real z. The Bernoulli numbers behave as 
2(2N)! 
Bon = (—1)"*1 UN) (2N). (B.8) 


(27)2 
The zeta function being bounded, we see that the Stirling approximation 
diverges. 

A series expansion can be obtained for the factorial of a positive integer N by 
expanding the [ function. Typically, the series expansion gives a very accurate 
approximation for the function that becomes maximally accurate after a certain 
number of terms in the expansion. At any finite truncation of the series, the 
remainder can be understood to be smaller than the subsequent term that has 
been dropped. Thus if we have a function f(x) and its asymptotic series g(x) + 
go(a) +--+ then 


F(@) — (g1 (x) + 92(@) ++: + gr-1(2)) ~ g(x) (B.9) 


for each k up to a maximum kmax which depends on zx. For larger values of £z, kmax 
increases. But after this term, the expansion starts to diverge, and it is not a good 
approximation to the original function. Thus for the Stirling approximation, for a 
given N, we should sum a finite number of terms to obtain a good approximation 
to N!, that number fixed by the value of N. However, if we look at the subsequent 
terms in the expansion, we find that they start to increase, and eventually they 
increase so much that the series fails to converge. Truncating the series at a given 
term kmax gives an approximation that is as small as the first term neglected, 
which can be very good approximation even though the asymptotic series does 
not converge. 
We use the notation 


f(x) — (gi (x) + g2(@) +-+: + gk-1(2)) = o(gn(x)), (B.10) 
which generally in physics is translated as the difference 
f(x) — (gi (x) + g2(@) +: +gk-1(2)) (B.11) 


is of the order of g(x). However, there is a precise mathematical sense to this 
relation, it means that for every positive e there exists a positive real number X 
such that, for x > X, 


f(x) — (91 (2) + g2(@) ++: + gr-1(2)) < €ge (2). (B.12) 
If f(x) =o(g(x)) and g(x) £0, then 
tim 2) i; (B.13) 


z> g(x) 
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